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DEVELOPMENT AND VALIDATION OF A PROJECTIVE MEASURE 
OF POWER MOTIVATION! 
JOSEPH VEROFF 


Survey Research Center, University of Michigan 


HE purpose of this study was to develop a 
valid measure of power motivation. Mc- 
Clelland and his associates have shown 
that scoring stories written in response to 
pictures can be used as a basis for measures of 
strength of the achievement (8) the 
affiliation (3, 11) motives. These investigators 
have shown that with an increase in ex- 
perimentally aroused achievement or affiliation 


and 


motivation, there was in the stories an increase 
in the frequency of certain kinds of imagery 
related to these motives. Furthermore, the 
coding categories, which can be used to reflect 
these shifts in imagery can be successfully 
employed as a basis for measuring individual 
differences in achievement and _ affiliation 
motivations. Subjects who have high scores on 
these measures are significantly different from 
subjects who have low scores in behaviors 
which are meaningfully related to the strength 
of the motives. 

Following the same general procedures of 
these studies, the experimenter anticipated 
that a parallel measure of another motive 
the power motive—could be developed. 

What is meant by motivation? 
Numerous social theorists have used a concept 
of power. In some cases, the concept is re- 
legated to the part it plays in the analysis of 
social structure. In other cases, social power 
is a dimension to be considered in the analysis 
of interpersonal relationships. In the latter 
sense, the power motive will be considered 
that disposition directing’ behavior toward 
satisfactions contingent upon the control of the 
means of influencing another person(s). In the 
phenomenal sphere of the power-motivated 
individual, he considers himself the “‘gate- 
keeper’ to certain decision-making of others. 
The means of control can be anything at all 


power 


! The research reported in this paper was submitted 
to the faculty of the University of Michigan in partial 
fulfillment of the requirements for the Ph.D. degree 
The author would like to acknowledge the assistance of 
Dr. Daniel Katz in directing this work. The project was 
carried out while the author was a Research Training 
Fellow of the Social Science Research Council. He is 
indebted to the Council for its financial aid 


that can be used to manipulate another person. 
Overt dominance strivings can be considered 
one kind of control execution, The definition of 
the power motive offered here, however, is 
meant to include more than dominance. The 
importance of this kind of social motivation 
has been elaborated by Adler (1) and Sullivan 
(12) and has been brought to bear upon some 
of the recent research in group dynamics (6). 

Before one can make these predictions about 
the relationship of power motivation to be 
havior, a means of measuring the strength of 
the motive has to be established. The design 
of this research to develop such a measuring 
instrument falls into two steps. 

First, written stories are obtained in response 
to pictures from two groups, one for whom it 
can be assumed that power motivation is 
aroused, and the other for whom it can be 
assumed the concern about power is not 
specifically aroused. A scoring system isolating 
the kind of imagery presumably reflecting 
power motivation can then be applied to 
these two sets of stories. The stories from the 
aroused group should contain more imagery in 
the categories scored for power motivation 
than from the nonaroused group. 
Differences in imagery obtained between the 
two groups provide an empirical clarification of 


those 


how power motivation is expressed in thought 
processes. 

Second, the expected differences in imagery 
produced in these two experimental! groups are 
used as the basis for a measure of individual 
differences in power motivation. Scores so 
derived should predict individual differences 
in other behaviors related to satisfying power 
motivation. 


EXPERIMENTAL DESIGN AND HYPOTHESES 
Step 1: Group Differences in Power Motivation 


For the arousal condition the experimenter 
was able to take advantage of a natural situa 
tion: the election of student the 
University of Michigan. The candidates must 
petition for their candidacy. Once having 
declared candidacy, they have a month's time 


leaders at 
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for campaigning before election 
After 
conyregate at the polis to see 
ing has decided 


the nave 


actual 
candidates 


what the ballot 


two days ot bal ny, the 


At least two hours elapse alter 


congregated before any official 


announced. It was therefore de 


this 


candidates 


return ire 


employ waiting period for the 


cided to | 


tudy. A 


other opponent were previously asked by mail 


who had at least one 


to cooper ite in a research project designed to 


get at important social variables. The only 


information vas that the experimenter 


vyiven 
them as leaders, 


the 


was interested in campus 


efiects of 


Because 


, 
social 


the 


wa interestec in 


situatior upon imagination 


candidate were presumably extremely 


ibout the 


con 
control of the means ol 
the 


requirements tor 


cerned 


influence being elected to office situa 


tion was thought to meet the 


of the motive as it was defined 
not all the 


bias 


nowever:r, 


arousal 
ince candidates participated, 


volunteering may have been introduced 


Such bia would seem irrelevant to 
vestigation Al) 
Ss who appeared were used. No differentiatian 
the kind of 


since 


the problem under in 34 male 


within tl group was made on 


h they were candidates, 


rite ted omnces 
ty of 


fluc ncing others 
lo heighten the 


involved the re 


ponsibi making decisions and in 


relevance of the cues of the 


junting situation to the story-writing 


procedure, the candidates were asked to judge 


their chances of winning the election, on the 


following scale: 0/5, 1/5, 2/5, 3/5, 4/5, 5/5 


chance The whole procedure was done 


anonytl 
nonaroused condition were 34 
course in 


The 


whic h 


in undergraduate 
University of Michigan 
procedure introduced as one about 
interested in 
After the 
asked to sign their 


it further measure 


sone ur iu ] nts 


were 


obtaining some normative data 


procedu } were 
name could be 


their story-writing perform 


re isonable 0 


the 


this 
ibout 


eemed regard 


ne lor whom con 


erm 


means of influence was not 


iv iged 


mav argue that differences in respects 


other than aroused power motivation underlie 


such differences between these t 


Vo groups as 


may be obtained in their stories. Perhaps 


Ss in the aroused group have had peculiar 
them to 
reasons 


learning experiences predisposing 
for 
having nothing to do with power motivation as 
The validity of this 


granted; there is no means of isolating this 


write power-related imagery 


defined argument is 
factor in the present design. Greater reliance is 
placed on the second step in the design 

predicting power-motivated behavior from a 
the first step—for 
tentative validity of the 
This problem 
will be more fully treated in discussion of the 


derived from 
the 


power 


measure 
establishing 
measure ol motivation 
results 

Story-wriling procedure. All Ss wrote stories 
for five pictures in a manner outlined in The 
Motive (8 
difference is that the pictures were reproduced 


i chievement The only procedural 
in booklets rather than projected on a screen, 
and instructions modified accordingly. The 
five pictures are listed below in the order which 
they were presented: 

1. Two men in a library; Picture E in the 
Achievement series (8 

2. Instructor in classroom; Picture 21 in 
Birney’s study (5 

3. Group scene; Picture E in the Affiliation 
series (3). 

4. Two men on campus; Picture 41 in 
Birney’s study (5 

5. Political scene; from magazine, W oman’s 


Day 


Step 2: Individual Differences in Power Motiva- 
lion 


Since the stories of the nonaroused Ss could 
be identified, other measures possibly related 
to a derived score of power motivation were 
Allport-Vernon 
Vocational 


obtained in this group: (a 
Scale of Values (2), (6 
Inventory (7), ( 

(d) 


behavior 


Goals 
questionnaire on siblings, 
classroom 
Inventory 


ratings by instructors of 


rhe 
required Ss to rank-order ten listed possible 


and 
Vocational Goals 
satisfactions on a job: being leader, interesting 
job, recognition from fellow men, being boss, 
surety of keeping iob, self expression, high pay, 
fame, helping others, working on one’s own 
The asked Ss to list 
their permit re 


sibling questionnaire 


siblings and their ages, to 
birth 
instructors pertained to the following class 
the 


likes to argue and contradict; (6 


construction of order. The ratings by 


room behavior: (a) Argumentative in 


classroom 








\ PROJECTIV! 


Eager to get points across in the classroom in 
order to convince either the instructor or other 
students of a point of view. They were made 
on a three-point scale of frequency: 1 
* and 3 


hypotheses were 


“very 


frequently,” at times,’ “never 


2 
or rarely.” The following 
advanced concerning the relationship between 
individual differences in power-motivation 
scores derived from the projective instrument 
and the variables measured by the preceding 
measures 

Hypothesis 1. Power motivation is positively 
and 


vaiue 


Political Economic, and 


social 


related to 
onentation, as 
Scale of 


negatively to 
the 
A person highly 


measured by Allport-Vernon 


interested in con 


Values 


trolling the means of influence should value 


those ideas having to do with such control 


within the culture-—namely, the political and 


economic values. Being interested in con 


trolling person in a sense negates his sym 
pathy for this person’s feelings, and therefore 
the Social 
dimension was anti ipated 

Hy pothe 


related to the intensity of interests in the job 


negative relationship with the 


Power motivation 1s positively 


satisfactions of being boss and being leader 
These job satisfactions can be interpreted as 
positions in which the potentiality for con 
trolling the means of influence is high 

Hy pothe 


associated with the number of siblings in. the 


3. Power motivation is positively 


S’s family. Furthermore an association was 


hypothe sized between birth order and strength 


ol power motivation, though no prediction 
about the direction of the association is made 
Power satisfactions are perhaps derived from 
many experiences in which being manipulated 
others 


that the 


were If 


high 


by others and manipulating 


evidence. One would expect 
power-motivated individuals would come from 
larger tamiulies where this type ol experience 
is more likely to occur. Birth order would seem 
to be an important variable in further shaping 
these experiences 

Hypothesis 4 


witt 


Power motivation is positively 


iated instructors’ ratings of argu 


assiUt 
mentativeness and trying to convince others 
These behaviors are presumably instrumental 


in satisiving the power motive in the classroom 


SCORING STORIES FOR POWER MOTIVATION 


The method of scoring the stories 


parallel the pattern esta! 


MEASURE 


OF PowER MOTIVATION 


Phe 


se que nce 


(4), achievement (&), and affiliation (3) motives 


follow in a_ behavioral 


with 


categories scored 


analysis which is discussed some detail in The 


ichievemeni M otive (8 
In scoring a story it is first identified as being related 
to power motivation or not If a scorer decides that the 


story is related to power motivation (i.e., the story 


meets the Power Imagery criteria given below), it is 


then further scored for the presence or absence of the 
categories in the behavioral sequence. If the scorer de 
cides that the story is not related to power motivation 
The Power 
A complete 
13) 


contains a 


(Unrelated), no further scori ig 8 done 


Imagery criteria are given in detail below 
scoring system with examples is available 
when a story 


Power Imagery is scored 


feelings, or actions of one of 
that 


of the means of influencing 


reference to the thoughts 
the 
concerned with the control 


characters which indicates the character is 


irom 
affect; (b) 


a person. Evidence of concern can come one or 


more of three source a) statements ol 


statements of control activity; (c) statements of 


I he s€ 


supe 


rior-subordinate role relations three criteria are 


elaborated below 

a. There is some statement of affect surrounding the 
maintenance or attainment of the control of the means 
A character can be feeling good 


because he 


of influencing a person 
about winning an argument or feeling bad 
his Also 


or showing dominance 


was unable to have way statements about 


wanting to win a point wanting 
to gain control (such as by a political or executive posi 
tion), convince someone ol! son ething or put a point 
statements ol 


the 


can be interpreted as implicit 
about the 


Affective concern .can also be found in state 


across 


affective concern control of means of 


influence 
ments of wanting to avoid weakness. Examples of this 
are: being humiliated in a status position, being 
ashamed of an incapacity to assert one’s self or become 
resenting the influence of another and 
this 


statement 


dominant, 
wanting to overcome 
of concern over control 
Statements of 


Some very weak 
of the 


desires to teach another person something, to interest 


means of influence are scored 


another person in something, to inspire someone 


although apparently weak in obvious power signifi 


times when these 
the 
the 


cance should be scored. The only 


statements would not be scored are in cases where 


teaching, inspiring, or interesting is solicited by 
person being influenced 

Statements of a person wanting another person to 
gain control of the means of influence--such as a person 


wanting someone else to win an electior should not be 
Che only exception is a case 
the 


kind of 


cored as Power Imagery 


in which there is a clear identification of 
with the rhis 


may appear in election stories where John 


very 


person preferred candidate 
imagery 
wants to win the election, although John is not the 
candidate 

b. There is doing 


something about maintaining or attaining the control 


a definite statement of someone 


of the means of influence. Something that the character 


is actually doing in the story is the only kind of imagery 
that can qualify as Power Imagery under this criterion. 


rhe character has to dispute a position, argue some 


thing, demand or force something, try to put a point 


cro try to convince someone ofl omething and 
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theoretically any activity in order to obtain control of 
the means of influencing someone) 

Three special considerations should be noted under 
The two specia! considerations noted under 
Statements 


Criterion 6 
Criterion a are applicable te Criterion 6 


nterest, 


involving trying to teach, 
are scorable under Criterion b. Trying to win an election 


or inspire someone 


scorable unless there is a 
the campaigning 
Sometimes, moreover, it 
although it meets the 


for someone elise will not be 


close identification between person 
and the candidate for office 
will be clear that an activity 
criterion listed in the preceding paragraph, is only for 
the purpose of arriving at the goal of another motive 


the 


In such a story should not be scored for 


Power Imagery 
c. A story can be scored for Power Imagery if there 


case, 


is a statement of an interpersonal relationship which in 
its execution is culturally defined as one in which there 
is a superior person in control of the means of influencing 
a subordinate person. Examples of this are: boss-worker, 
judge-defendant. Mere mention of superior-subordinate 
relationship is not enough to score. There has to be 
rABLE 1 
NUMBER OF SUBJECTS IN AROUSED AND NONAROUSED 
Grours Apove AND BeLow MeEpIANn* For 
Various PowerR-KELATED [MAGIN 


ATIVE CATEGORIES 
Aroused Non 
aroused 
Group Group 
(News (N = ¥) 
Imaginative Category 23 
% 4 4 s 
4 v Sb rv 
< -_ < ~@ 
Power Imagery 27 71:18) 19 0023 


Breaking point 
between 1 and 2 


Need 21 13 14 20 047 
Instrumental Activity 
ositive 22 12 15 19 047 
Negative 12 | 21 8 | 26 090 
Questionable 16} 18 | 13 | 21 | ne 
Any sign 24110) 13 | 21 0056 
Break ing point 
between | and 2 
Goal Anticipation 
Positive 5 $1 1 33 | ns 
Negative ) $1 5 s1 | ns 
Any sign 6 | 28 5 $1 n.s 
Block 
Personal 4 WwW l 33 | ns 
Environmental ee) 2 32) ns 
Affective State 
Positive 4 0) ] 33 n.s 
Negative $31 2; 32 {ns 
Any sign 6 | 28 $/ 31 ins 
Thema 28 6/17 17 0044 


* Median based on combined frequencies of both the aroused 
and the nonaroused groups 

t Except where indicated 58 were divided as to whether the 
this 


category appeared once or not at all in the five stories 


procedure approsimating the combined median most closely 
t All the probabilities were 


hypothesis, 1 4/ 


computed by Exact Test, one-tailed 


some mention of the activity involved in carrying out 
this relationship. Mere mention of a given person being 
influential would be enough to allow this story to be 
scored for Power Imagery 

Unrelated Imagery is scored when there is no evidence 
of power motivation concern in the stories as defined 
under Power Imagery 

When the story has been identified as one containing 
Power Imagery, it is scored for the presence or absence 
of the subcategories similar to those outlined for 
achievement motivation (8): Need; Instrumental Ac- 
tivity; Goal Anticipation; Block; Affective State; 
Thema 

In the final coding the coder did not know whether a 
given story came from one or the other of the two 
groups. To check on the reliability of this coding, a 
second coder scored a random set of twenty-four Ss 
from the total sample. The percentage of agreement for 
a given category between the two scorers ranged irom 
57 to 97. Furthermore, need for power scores (n Power 
scores) were tallied for each S by adding the number of 
categories appearing in his stories. The n Power scores 
obtained from the two coders were rank-ordered for 
The rank-order correlation (rho) be 


these Ss by the 


the subsample 
tween the ranks obtained on 
judges was + .87 


two 


RESULTS 
Step 1: Group Differences in Power Motivation 


Imagery in the stories written by the 
aroused and nonaroused groups was compared 
by dichotomizing the frequency distributions 
of the combined groups as near to the median 
as possible. Table 1 gives for each category the 
number of Ss in each group above and below 
the median of the distribution 
Exact tests were computed for each instance 
that potentially 
deviation from a chance distribution. 
Imagery, Need, Instrumental Activity, and 


combined 


significant 
Power 


represented a 


Thema appeared significantly more often in 
the aroused group’s stories than in the non- 
incl 


aroused group’s stories. These results, 


TABLE 2 
NcMBER OF SUBJECTS IN AROUSED AND NONAROUSED 
Groups ABOVE AND BeELow Mepran* 
or N Power Scores 


n Power Score 


Group 
Above 6 Below 6 
Aroused group (N = 34) 22 12 
Nonaroused group (NV = 34 9 25 
Note.—g = .0013, one-tailed hypothesis, 1 d/ 
* The approximation to the median was computed by combin 


ing the n Power scores of both the aroused and the nonaroused 


groups 
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TABLE 3 


FREQUENCIES OF Hicu-Low n Power Scorers rin Various CATEGORIES ON MEASURES 
RELATED TO Power MOTIVATION 


Measure Dimension 
Allport-Vernon Political 
Scale of Values High* 
Low 
Economi 
High 
Low 
Social 
High 
Low 
Vocational “A job where you could 
Goals be leader”’ 
Inventory High 
Low 
“A job where you could 
be boss’’ 
High 
Low 
“A job where you could 
be looked upon very 
highly by fellow 


men” 
High 
Low 
Sibling No. of Siblings 
Questionnaire 0 
l 
2 
3 or more 
Has older sibling 
Yes 
No 


Argumentativeness 
Frequency 
ligh 
Medium 
Low 
Frequency of Trying to 
Convince Others 
High 
Medium 
Low 


Instructor’s Ratings 


High n Power Low n Power 


~Is 


— 


9 
7 

? 
Z 


Chi square p df 
5 
8 - ns — 
5 
8 n.s - 
9 
4 - 12 1 
4 
10 .096t 1 
8 
6 ae os 
3 — .049 1 
11 (2 tailed hypothesis) f 
1 
4 
3 n.s -- 
3 
5 
6 0.8 — 
1 
7 5.62§ <.05 2 
7 
] 
6 11.02 <.001 2 
8 


High and Low division on Allport-Vernon Scale of Values based on deviation from normative data (2) 


t The probabilities of the 2 K 2 tables were computed by the Exact Test 


5 
dentally, are comparable to those obtained in 
similar comparisons in respect to the achieve 
ment and affiliation motives, except that the 
categories Goal Anticipation, Block, and 
Affect failed to differentiate the groups 

An over-all check on the differences in 
imagery can be made by comparing the n 
Power scores for the Ss in the two groups 
Table 2 shows the number of Ss in each group 
above and below the combined groups’ median 
n Power score. The probability of obtaining 
this distribution by chance is .0013, again as 
estimated by an Exact Test 


All other probabilities were estimated under a one-tailed hypothesis 
Chi squares where reported were estimated by the Mood Maximum Likelihood Ratio (9) 


Step 2: Individual Differences in Power Motiva- 
lion 


Scores of the nonaroused group for n 
Power were dichotomized as near as possible 
to the group median. The resulting division 
into High and Low n Power scores is retained 
through all comparisons to be made in this 
section. The results of the comparisons made 
between the n Power scores and the variables 
for which relationships were predicted appear 
in Table 3. In most cases the other variables 
are dichotomized near the median into high 


and low categories 
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This table supports the following observa 


tions which confirm predicted hypotheses. 


High n Power scorers, as compared to low n 
have lower scores on 


Power tend to 


the Social Value Dimension of the Allport 


scorers 


Vernon Scale of Values; show a significantly 


stronger interest in the job satisfaction of 
being leader; were rated significantly higher 
by their instructors on the frequency of 


argumentation and the frequency of trying to 
convince others of their points of view in the 
classroom 

Other hypotheses were not confirmed. High 
n Power scorers as compared to low n Power 
did not their 


scores on the value orientations Economic and 


scorers differ significantly in 
Political, on their ratings of the job satisfa 


tion of being boss, and on the number or 


order of siblings in their families 

lable 3 also includes an unpredicted result, 
that 
satisfaction of obtaining recognition from their 


high n Power scorers rated the job 


fellows significantly higher than low n Power 
the 


Goals 


scorers. No other comparison between 


the Vo ational 
Inventory yielded any positive results 


motive measure and 


[DISCUSSION 


Phe results obtained comparing the imagery 


produced by the aroused and nonaroused 


groups are similar to parallel results found 
with the measures of achievement and affilia 
tion motivations. In the present design how 
ever there is an ambiguity in the nature of the 
arousal condition which was not present in the 
the fact that 


power arousal group was different from the 


earlier studies. This ambiguity 
the 


nonaroused group not only in presumed level 


of arousal of some motivational pattern but 
also in whatever dispositions are salient in 


discriminating candidates and noncandidates 


forces one to make only 


lor college othces 


tentative conclusions about these results. It 


may be that the group differences uncovered 
in the analysis of the stories are due to motiva 
tions which are distinct from the power motive 
as defined 

I'wo other reasonable interpretations of the 
between the two groups 


the 


obtained difference 


ittribute them to differences in level of 
} 


achievement or Fortu 
was possible to check the former 


recognition motivation 


nately it 


p yssibility The two sets of stories were scored 


for achievement motivation as outlined in 


The Motive (8 
difference in mean n Achievement scores of the 


No significant 


Achievement 


groups was found 


The other interpretation, that the group 
differences are accountable in terms of recog- 
nition rather than power motivation, is a 


distinct possibility. The only basis for thinking 
otherwise is that the scoring system was built 
around a conceptualization of power motiva- 
tion as distinct from strivings for recognition. 
Recognition themes in stories were not scored 
unless they fulfilled the criteria for power 
motivation as well. That is, if a story brought 
up problems of recognition as a way of obtain- 
ing control of the means of influence, then 
and only then was it scored. There is, however, 
no experimental way in the present study to 
rule out the interpretation of the power- 
motive measure in terms of recognition. In 
further use of the measure of power motiva 
tion caution to recognize this possible con- 
tamination is recommended 

There seemed to be no difference between 
the aroused and nonaroused groups on certain 
of the affected categories (Goal Anticipation, 
Affect, and Obstacle), which proved valuable 
in discriminating groups with the other 
measures. An explanation for this discrepancy 
offered. Perhaps this particular 
motive the 
sociated affective thoughts. 
good cause to inhibit the expression of affect 


should be 


does not permit arousal of as- 


There might be 


surrounding the power area, since there are 
many recognized taboos for expressing power 
the 
from 


in this culture—especially expression of 


overt satisfactions derived power be 
havior. Support for this explanation comes 
from the that 
appeared very infrequently in the stories of 
bott Another 
differential appearance of 


might be that the particular pictures used in 


observation these categories 


groups reason for the lack of 
these categories 
this study did not provide cues to the kind of 
thinking that reflect the affective 
component. This possibility can be tested in 


would 


further research 

The fact that the measure not only reflects 
the group differences in assumed motivation 
level, but also individual differences in varia 
bles related to aspects of power concern as 
defined, suggests that the instrument can be 
used with success in researches in which such a 
measure of motivation would be needed. The 


measure of power motivation, if some of its 
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ambiguities are clarified, can be potentially 
used for exploring dimensions of control be- 
havior exhibited in interpersonal relationships. 
Researchers using concepts of power different 
from the one presented here, should be 
cautious in the use of this measure. 

Some of the hypothesized relationships, 
particularly with the Allport-Vernon scales, 
were not substantiated in this study. These 
value measures can perhaps reflect many 
different motivational orientations, and it may 
thus be naive to expect a positive relationship 
between a fairly specific motivational pattern 
and values that can service many different 
needs 

Lack of support for the prediction that Ss 
high in n Power would rate the job satisfaction 
of being boss higher than Ss low in n Power 
may indicate that re-evaluation of the meaning 
of being boss for them is in order. ‘‘Boss”’ can be 
thought of as a negative word in this culture, 
and, indeed, there was little variability in the 
Ss’ ranking of job satisfaction in this respect 
Nearly everyone ranked this alternative low 

rhe simple prediction that high n Power Ss 
would come from large families was not 
substantiated. This finding would seem to 
indicate that any relationship between the 
power motive and the family structure de 
pends on more complex considerations than 
simple frequency of contact with other family 
members. When the distribution of n Power 
score is divided into thirds (high, medium, and 
ow), a trend was uncovered. Eight out of 
twelve high n Power scorers had at least one 
older sibling whereas this was true for only one 
out of six medium n Power scorers and five out 
of nine low n Power scorers. (Chi square 
4 34. p « 15. 2 df. two-tailed hypot 1€S18 
Perhaps some fruitful distinctions can be made 
in the area of family structure which may be 
useful in understanding the development of 
the power motive 

The finding that high n Power Ss rated the 
job satisfaction of gaining recognition from 
their fellow men higher than do low n Power 
Ss points to the possibility that the satisfac 
tions olf prestige may be intimately tied with 
the satisfactions of power rhis point of view 
has been elaborated by Murphy (9). There is 
also the possibility that the measure includes, 


} 


least partially, some aspects of recognition 


‘ 


al 


motivation Indeed. all of the Significant 


findings can be theoretically accounted for by 


considering the motive measured to be the 
rec ognition motive. 

The relationship between achievement and 
power motivation. The product-moment cor 
relation between the n Achievement and n 
power scores was +.27 (p < .05). An attempt 
was made to use the n Achievement scores, 
however, to predict the same variables with 
which n Power scores had been related. No 
significant findings with these variables were 
obtained. In short, there seems to be a slight 
positive association between the two motives, 
but what is measured in scoring for the power 
motive is that process which seems to be 
directly related to power, rather than achieve 
ment strivings 


SUMMARY 


['wo groups of Ss wrote stories in response 
to five pictures. One was tentatively con 
sidered to be aroused by cues relevant to power 
motivation; the other was not aroused. A 
scoring system paralleling the coding methods 
which had been fruitful in measuring the 
achievement and afhliation motives was ap 
plied to the stories, and succeeded in differ 
entiating the two groups. Four of the seven 
categories coded appeared significantly more 
frequently in the stories of the aroused group. 

Individual differences in power motivation 
were estimated for Ss in the nonaroused group 
by the n Power scores (based on the coding 
categories). High n Power scorers differed 
from low n Power scorers in this group on a 
number of other factors relevant to power 
motivation. Some hypotheses relating strength 
of power motivation to other variables were 
not confirmed 

Although there was a significant positive 
correlation between n Power scores and n 
Achievement scores, the n Achievement score 
distribution does not predict the same rela 
tionships with power-related variables 

In conclusion, the results indicate that the 
projective measure of power motivation de 
veloped here not only can successfully isolate 
the presumed differential level of motivation 
in groups, but also can successfully predict 
ittitudes and overt behavior which are pre 
sumably related to the processes involved in 
power motivation. It is cautioned, however, 
that the measure may be conceived also in 


terms o! recognition motivation 
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GSR REACTIVITY AS A FUNCTION OF ANXIETY, 
INSTRUCTIONS, AND SEX 
JAMES L. BERRY ann BARCLAY MARTIN 


University of Wisconsin 


EVERAL studies indicate that GSR condi- 
tioning is positively related to the level 
of anxiety of the subject. Bitterman and 

Holtzman (4) found such a relationship when 
an over-all index of anxiety was assessed on the 
basis of several behavioral ratings and test 
scores. Welch and Kubis (10) found that psy 
chiatric patients diagnosed as anxious showed 
more rapid GSR conditioning than a group of 
normals. Beam (3) found GSR reactivity to in- 
crease immediately before anxiety-arousing 
situations, such as oral examinations, as com- 
pared to a measure taken at another time 
However, results have been rather uniformly 
negative in terms of a relationship between 
anxiety and GSR conditioning when the Taylor 
Anxiety Scale has been the sole measure of 
’ Neither Bitterman and Holtz- 
man nor Beam found any relation between 
GSR conditioning and the Taylor Anxiety 
Scale 
The present study was designed to test the 
hypotheses that the Test Anxiety 
Scale and the kind of instructions given to the 
Ss are related to GSR conditioning. The Test 
Anxiety Scale developed by Sarason (6) was 


anxiety level 


Sarason 


tried in this study because of the previous fail 
ures of the Taylor Scale to be predictive in this 
area, ar i because it was thought on an a priori 
basis that the nature of the Sarason Test was 
such as to give a more valid measure of anxiety 


proneness among college students. The Test 


Anxiety Scale requires S to indicate on a series 
of five-point scales how he feels right before 
and during examinations in college courses, 
group intelligence tests, and individual intelli 
gence tests 

Three different kinds of 
given with the intent of varying experimentally 
the level of apprehension that the Ss would 


instructions were 


have about the conditioning situation. Taylor 
(9) varied instructions in a conditioned eye- 
blink study by telling one group that the air 
puff would increase in intensity as the trials 
progressed and telling another group that it 
would decrease. She found no differences be 
tween the groups. No previous attempts to 





modify GSR conditioning by instructions were 
discovered. 
METHOD 


From a class of 330 introductory psychology stu- 
dents, extreme groups of 60 high-anxiety (30 male and 
30 female) and 60 low-anxiety (30 male and 30 female) 
Ss were selected. The Ss from each of these extreme 
groups of the same sex were then placed rancomly in 
one of the three instructions groups. This procedure 
provided an anxiety-by-instructions-by-sex factorial 
design with 10 Ss in each of the twelve groups 

A modified form of the Sarason Test Anxiety Scale,' 
with items referring to individual intelligence tests 
omitted, had been administered to the class as a group 
Answers were recorded on regular five-point IBM 
answers sheets. The Ss were then seen individually for 
the second session 

The Apparatus used to measure GSRs was an AC 
circuit designed by Grant (5). Dry silver finger elec 
trodes were held on the palmar surfaces of twe adjacent 
fingers on the right hand by rubber straps. Shock was 
supplied by a vacuum tube circuit which provided a 
constant current of .30 ma. to the left arm of the S. An 
audio-oscillator provided a 4,000 cycle tone. Cams 
attached to a one-cycle-per-minute motor were con- 
structed so that a tone of 5 seconds’ duration followed 
immediately by a shock of 0.5 second’s duration could 
be presented every 30 seconds. The shock could be 
eliminated when desired by a switch 

lhe instructions given to the three different groups 
were as follows 

Group 1, Apprehension Arousal. Throughout the 
course of this experiment you will receive several 
electric shocks, such as the following (2 sample 
shocks). At one point in the experiment you will 
receive a shock considerably stronger, approximately 
twice as strong as the ones you just experienced 
We realize that the shock is already strong and, 
because increasing it too much might have bad 
eflects, you will receive only one of the stronger ones 
This one may occur at any time during the course 
of the experiment. Do you have any questions before 
we begin? 

Group 2, Neutral. Throughout the course of this 
experiment you will receive several electric shocks, 
such as the following (2 sample shocks). Do you have 
any questions before we begin? 

Group 3, Reassurance. Throughout the course of 
this experiment you will receive several electric 
shocks, such as the following (2 sample shocks). These 
may seem rather strong to you now but you will find 
that you will adapt to them rather quickly and, after 
a few more, it will begin to feel like a pricking or 


* We are grateful to Dr. Sarason for supplying us with 
a copy of the Test Anxiety Scale 
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tickling sensation. Most subjects haven’t been 
bothered by them. Do you have any questions 


before we 
Each 


conditioning 


begin? 
S was then given identical treatment in the 
minutes of relaxation 


situation. Four 


were followed by five trials with the tone alone, after 
which 10 conditioning trials and 20 subsequent extinc 
tion 
schedule was used during conditioning in which no 


and 9. Skin resistance 


rials were administered. A partial reinforcement 


shock was given on trials, 3, 5, 
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TABLE 1 


SUMMARY OF THE ANALYSIS OF VARIANCE OF 
CONDITIONING TRIALS 





| | 

Source “f igaare| 7 | # 
Total 119 | 
Within Groups 108 | 7.89) 
Sex 1 |103.9 |13.2 | .O1 
Anxiety 1 |, 29| + .04 
Instructions 2) 14.8] .93) 
Sex X Anxiety 1 | 23.81) 3.01} .10 
Sex X Instructions 2 | 26.85) 3.40) .05 
Anxiety X Instructions 2 4.44 56) 
Sex X Anxiety X Instruc- 2 1.92 37 


tions 


readings were taken immediately before the tone and 
of the lowest point reached during the tone for all 
trials 


RESULTS 


The curves for adaptation, conditioning, and 
extinction are shown in Figs. 1-3. Resistance 
was transformed to log conductance — 1 to 
minimize the GSR and 
base level that is found when resistance is used 
measure, and trials were combined in 
pairs for conditioning and extinction curves. A 


correlation between 


as the 


summary of the analysis of variance for re- 
peated measures for the average response dur- 
ing the conditioning trials is shown in Table 1. 
Bartlett’s test for homogeneity of variance was 
nonsignificant. The F values for sex and the in- 
teraction of sex and instructions were signifi- 
cant at the .01 and .05 levels respectively. The 
interaction of sex and anxiety did not reach 
Statistical significance. 

In order to investigate the possibility of con- 
sistent trends as a function of trials, an Aiex- 
ander trend analysis (1) was performed. The 
over-all group slope was not significantly dif- 
ferent from zero, nor were the subgroups sig- 
nificantly different from each other in slope. 
It is apparent from Fig. 1 that most of the in- 
reactivity the conditioning 
trials as compared with the adaptation trials 
occurred during Trials 1 and 2. 

The extinction data were analyzed by com- 


creased during 


puting the average response of each S over the 
20 extinction trials, and then performing an 
analysis of variance upon these scores. Table 2 
summarizes this analysis, and indicates that 
only the between-sex variance was significant. 

*In order to make the figures easier to read, the 


transformation was actually as follows: 100 (log con 
ductance —1 





id 
all 
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TABLE 2 


SUMMARY OF THE ANALYSIS OF VARIANCE 
oF EXTINCTION TRIALS 


Source 4 \gquare) ? | ? 
Tota! 119 
Within groups 108 | 99.1 
Sex 1 |502.7|5.07| .05 
Anxiety 1 |150.511.52 
Instructions 2} 31.4) .32 
Sex X Anxiety 1 |254.6)2.57 
Sex < Instructions 2 |253.7|2.56 
Anxiety < Instructions 2} 57.4 .58 
Sex X Anxiety X Instruc 2 | 16.6) .16 


tions 


The sex difference was found to hold during 
adaptation also. The average of Trials 4 and 5 
of tone alone was found to be significantly 
higher for the female group (F 14.1, p « 
001 

The results are more clearly visualized in the 
curves. A comparison of the male and female 
curves show a marked tendency for females to 
have greater GSRs during all phases of the ex 
periment. The interaction of sex by instruc 
tions is apparent when Fig. 2 and 3 are com 
pared. For the males, the Reassurance instruc 
tion group is considerably below both of the 
other instruction groups. For the females, the 
Apprehension Arousal group is the lowest of 
the three. Thus, there is a tendency for the in 
structions to have opposite effects on males and 
females. This tendency was further analyzed 
within each sex group, using the average re 
sponse during conditioning as the measure 
he between instructions variance was signifi- 
cant for the female groups (F 3.59, p < .05). 
The between-instructions variance for the 
male groups, however, was not significant 
(Ff 2.44, p < .10). It is apparent from the 
curves that the Apprehension Arousal and 
Neutral male groups are very similar in their 
conditionability and that the Reassurance 
group falls below these groups. Since it was 
expected that the average response during ccn- 
ditioning would be less for the Reassurance 
group, it was considered appropriate to com 
pute the probability of this occurrence. This 
was done by the use of Whitney’s extension of 
the U statistic as described by Mosteller and 
Bush (7). The probability of the Reassurance 
group’s being lower than both the other groups 
by chance alone was less than .01 


Although the term “conditioning” has been 
used throughout this study, it is probable that 
much of the variance may be the result of indi 
vidual differences in general GSR reactivity, 
therefore reflecting pseudo-conditioning to 
some extent. That this is probably the case is 
suggested by other GSR data recently ob 
tained in the Wisconsin laboratory on an N of 
80 Ss in which the average GSR reac tivity dur- 
ing trials with tone alone correlated .74 with 
average response during conditioning and .64 
with averag¢ response during extinction. The 
possibility that pseudo-conditioning is involved 
of course does not make possible relationships 
between GSR reactivity and anxiety or instruc 
tions any less meaningful. 

The marked sex differences in GSR condi- 
tioning found in this study are consistent with 
a similar finding with eye-blink conditioning 
reported by Spence and Farber (8). Females 
gave more conditioned responses (p < .10) 
than males. Similarly, Aronfreed, Messick, and 
Diggory (2) found greater GSR reactivity in 
females than males. Whether females are con- 
stitutionally more reactive in GSRs or more 
apprehensive in situations of this sort obviously 
cannot be determined from these results. 

The findings seem to support the hypothesis 
that instructions affect the extent of GSR con 
ditioning. Reassuring instructions to the male 
Ss resulted in lessened conditionability. The 
significant interaction of sex and instructions is 
puzzling, and suggests that whereas the male 
groups are affected in the predicted and com 
mon-sense direction by instructions, the female 
groups are affected oppositely, Apprehension 
Arousal instructions producing a decrease in 
conditionability. A similarly puzzling tend- 
ency, although not significant, was for the fe 
male high-anxiety group to show less condi 
tioning than the low-anxiety group. It is per 
haps best to withold speculation about this 
contrariness of females until such an unex 
pected finding can be cross validated 

he Sarason Test Anxiety Scale did not 
prove to be significantly related to GSR condi 
tioning. Along with the previous failures of the 
Taylor Anxiety Scale to be related to GSR con 
ditioning, this finding raises the question of 
whether any set of paper-and-pencil items is 
related to GSR conditioning. This question is 
being presently pursued by empirical item 


analysis 
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SUMMARY 


GSR conditioning and extinction measures 
were obtained on 120 college Ss in a factorial 
experimental design in which twelve treatment 
groups were divided into two sex groups, three 
instruction groups, and high- and low-anxiety 
The 
given three different kinds of instructions de- 


groups three instruction groups were 


signed to be (a) apprehension arousing, (6) 
neutral, and (c) reassuring. The high- and low 
anxiety groups were selected as extreme groups 
on the basis of the Sarason Test Anxiety Scale 
Results indicated that the instructions had 
opposite effects on the conditionability of males 
less 


and females. The male Ss tended to show 


conditioning when given reassuring instruc 


tions than when given either neutral or appre 
hension arousing instructions. The female ‘Ss 
tended to show the least conditioning with the 
apprehension-arousing instructions and most 
conditioning with the reassuring instructions. 

rhere was no relationship between GSR con- 
ditioning and the Sarason Test Anxiety Scale. 
Females showed greater GSR reac tivity than 
males in all phases of the experiment; adapta 


tion, conditioning, and extinction. 


6 
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THE RELATIONSHIPS AMONG IMAGINATIVE, DIRECT VERBAL, 
AND PHYSIOLOGICAL MEASURES OF ANXIETY IN AN 
ACHIEVEMENT SITUATION! 
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EVERAL independent attempts have been 
made to relate different measures of 
anxiety (or fear) to performance in com- 

petitive achievement situations. In one series 
of studies (3, 11, 12, 15), anxiety level has been 
assessed by the Manifest Anxiety Scale (MAS), 
composed of refined MMPI items which have 
been classified by clinical psychologists as 
being indicators of general manifest anxiety. 
The experimental results indicate that sub- 
jects who score high in manifest anxiety condi- 
tion faster in a classical type of experiment 
than low-scoring subjects, but show poorer 
performance in more complicated learning 
Situations. 

Mandler and Sarason (7) have developed a 
Test Anxiety Questionnaire (TAQ) which indi- 
cates the extent to which a subject experiences 
anxiety before, during, and after three typical 
kinds of testing situations, thus having more 
specific situational reference than the MAS. 
Using the TAQ, Sarason, Mandler and Craig- 
hill (8) obtained results indicating that as the 
expectation of failure is increased, the high- 
anxiety subjects show a decrement in per- 
formance, whereas the low-anxiety subjects 
show an increment. 

McClelland ef al. (6) placed subjects in situ- 
ations differing in aroused motivation to 
achieve and asked them afterward to write 
stories to TAT-like pictures. From these 
stories a scoring system was developed to cap- 
ture the thought content reflecting the degree 
to which a subject was motivated to excel in 
an achievement situation. This need-Achieve- 
ment (n Ach) score related to behavior in a 
way that suggests that qualitative differences 
in thought content of persons who score high 
and low in n Ach represent real differences in 
expectancies concerning competitive situations 


! This article is a portion of a dissertation submitted 
in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy to the faculty of the University 
of Michigan. The writer wishes to express his deep 
gratitude to Drs. John W. Atkinson and Carl R. Brown 
for their patient direction and cooperation 

2 Now at University of Michigan, Flint College 


(1). McClelland (5) speaks of the high scorers 
as showing a positive hope of success. He infers 
a defensive fear of failure from the relative 
absence of achievement imagery and relative 
preponderance of negative categories in the 
stories of the low scorers. 

The present study attempts to investigate 
the interrelationships among these three meas 
ures and to test their relative effectiveness in 
accounting for reactions at another level of 
observation, i.e., physiological activity. All 
three measures give rise to distributions of 
scores described in terms of the amount of fear 
or anxiety that characterizes an individual 
generally (MAS) or specifically in competitive 
achievement situations (TAQ, n Ach). Tradi- 
tionally, increases in skin conductance have 
been found to be correlated with emotional 
states. It is expected, therefore, that groups 
characterized as anxious on the individual 
difference measures (high MAS, high TAQ, 
low n Ach), if validly classified, will show con- 
ductance changes that are more consistent 
with the patterns expected in an emotional 
state than the groups classified as nonanxious 


(low MAS, low TAQ, high n Ach). 


METHOD 
Apparatus 


The data were obtained in a room, 9 K 9 X 10 feet, 
in which was located the complex structure housing 
the essential apparatus. These essentials consisted of 
(a) a ground-glass screen 24 inches in diameter on 
which a %-inch spot of light moved, (b) a device which 
provided for a very irregular pattern of movement of 
this spot, (c) a recording polygraph, (d) a galvanometer, 
(e) a pneumograph, and (/) a red signal light 

The movement of the spot was accomplished by the 
projection of a %-inch spot on the screen by a mirror 
arrangement which was driven by two motor-driven 
cams in a highly irregular manner. A hand lever was 
connected in opposition to this device. A larger circle 
(3 inches in diameter) was also projected on the screen 

The recording galvanometer was used in conjunction 
with an electric network which automatically kept 
constant the current passing through the skin of S at 
40 microamperes and measured the apparent resistance 
in terms of the potential difference across the electrodes 
A continuous record of the changes in resistance of 
the skin was maintained by connecting the galvanom 
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stylus on the polygraph. One electrode was 


» the palm of the left hand, the other to the 


eter to a 
attached 
Each electrode 
filled 


olution in which a loose coil of 


underside of the wrist on the same hand 
was made of a fritted glass hiter of fine porosity 


} 


with physiological salt 


anodized silver wire was emersed 
The 
region ot 
A red 


when S had allowed an error to occur 


pneumograph was strapped to the abdominal 
S$ and connected to the polygraph 

signal light, located above the screen, indicated 

Phe 


errors were 


also indicated on the polygraph 


Subj ec ls 


students enrolled in 
of Michigan. Their 


grad and 


volunteer 
the | 


fron 


were 25 


The 5S 


variou 


mait 


hools in niversity 


ranged 
18 to 25 


cla tatu freshman to sate, 


their age from 


General Plan 


rhe general plan was to establish a testlike situation 


in which S was asked to keep the moving spot of light 


] 


within a smal! area by manipulating the hand lever 


introduction, the apparatus was described as 
the Air Force to test intellec 
The following test instructions 


In a brie 
being like devices used by 
tual and motor abiliti 
were given 

I an going to give 


lems that I 


ou a series of reasoning prob 


want you to solve as best you can. I am 


not going to ask you for a solution; I do not care 


whether or not you can verbalize it. But I want you 


to show me you know the answer by 


Kach 
of this spot of light over the screen. I 


your responses 


woblem consists of a sequence of movements 


will start the 
¢ which controls the movements of the spot 
ir task is to discover what the sequence of 
Now the lever 


the 


movernents is that the light follows 


right works in opposition to 


your 
If the 
1 movement by moving the k 
in the 


stick on 


machine moves the spot up you can 


ver so that 


oves opposite direction Therefore, 


ou are able to solve the problem correctly, you 


‘ be able to anticipate each subsequent movement 


of the light, and by manipulating the lever correctly, 


you can keep the spot from moving very tar In the 
center of the screen you can see a ring outlined. Your 
task | 
of the limit 
the spot to leave that area. Your task is to make a 
The 


ny tape here and after the 


to keep the spot of light from wandering out 


of this ring. It is an error if you allow 


good score by making as few errors as you can 
errors will be recorded or 


ession we will compare your score to that of others 


who have been through this test. You are to try your 
best to make as few errors as possibic 


Now 


cues to ax 


n this problem you will have se veral other 


l you be there to 


First, the ring will always 
indicate where the safe-area ends and the error-area 
time you make an error this 
It not only 


in error but will also aid 


Second, ever) 
the 


will indicate that you are 


begir 
red light above screen will flash on 
you in keeping a running count as to how well you 
going to 
begin the sequence and I want you just to observe 
what the problem is 


are doing. Before you begin the test, I am 


it for awhile so you can set 
like and try to figure out the pattern of movements 
I will tell you when I will begin the sequence and 
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when you are to begin avoiding making errors. Do 

you have any questions? 

Continuous records were taken of skin conductance 
and respiration during the entire experimental session.’ 
The session consisted of the following four periods: 
Null 1 Period—a five-minute period in which S actively 
manipulated the lever but was not yet given the test 
instructions. Anticipation Period—a two-minute period 
which immediately followed the test instruction. The 
S watched the movements of the spot but did not yet 
begin to avoid making errors. Solution Period—a ten 
minute period in which S actively attempted to make 
as few errors as he could by controlling the movements 
of the spot. Null 2 Period—a five-minute period after 
the Solution Period which was similar to the Null 1 
Period. No problem was presented while the S simply 
manipulated the lever. The first and last periods were 
conceived of as relatively free of anxiety-arousing cues 
It was not expected that the subgroups differentiated 
by the various dispositional measures would be different 
in their physiological reactions in these periods 

Since the spot actually moved in an irregular manner 
and no solution was possible, errors are not a meaningful 
variable in this experiment. An attempt was made to 
keep the frequency of errors as constant as possible for 
each S by manipulating the amplitude of the spot’s 
movement in relation to S’s actual performance 


Measures of Individual Difference 


Following the motor task, Ss the n-Ach 
stories while still connected to the recording equipment 
They were disengaged before taking the other two 
tests. To be consistent with the literature on these 
measures, each distribution of scores was broken into 
high, middle, and low thirds for comparison on the 
physiological indices. The n-Ach measure consisted of 
four TAT-like pictures, presented to elicit imaginative 
responses from which a score was determined according 
to the coding procedure McClelland, 
et al. (6). The experimenter and another trained coder 
scored all the stories. The reliability coefficient between 


wrote 


described by 


the two sets of scores was .82. Differences were resolved 
by reference to a third expert.‘ The scores for the high-n 
Ach third ranged from 8-14; for the low third, —3-2 

The Test Anxiety Questionnaire has been fully 
described by Mandler and Sarason (7). A high score 
indicates a tendency to reflect feelings of anxiety in a 
testing situation. The scores for the high-TAQ third 
ranged from 23-31; for the low-TAQ third, 1-13 

The Manifest Anxiety Scale has been described by 
Taylor (14). The scores for the high-MAS third ranged 
from 15-22; for the low third, 1-7 


Skin Conductance Measure 


The skin resistance records were analyzed in the 
following way: Samples from the records were taken 

*No significant relationships were found between 
any of the other variables in these experiments and 
any of the characteristics of respiration activity (ampli 
tude Hence no respiratory data will be 
reported 

* The writer is indebted to Mr. Berne Jacobs and 
Dr Joseph Veroff for their cooperation in this scoring 


rate, volume) 
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at the beginning of each period and at the end of each 
subsequent minute. These readings were converted into 
units of and an average 
based on these samples was computed to represent the 
entire period. Analyses were done on the average level 
in each period in terms of variation in conductance 
within each period expressed as a percentage change 
over the Null 1 Period average. The percentages were 
calculated as follows: ' 


conductance (micromho 


Mean Conductance in period 
- . x 100 
Mean Conductance in Null 1 


These percentages were converted into angles by the 


arc sin ‘\/ percentage transformation to insure the 


independence of means and variances (10 


Statistical Analysts 


The estimates 
differences derived 
designated Type III by Lindquist (4). Since there were 
twenty-five Ss in this study, it was impossible to have 
equai numbers in each of the three subgroups for each 
measure. The numbers in each subgroup, therefdre, 
were equalized by randomly eliminating one case from 
the subgroup that contained the unequal number 


error used to evaluate the mean 


were from analyses of variance 


RESULTS 


Interrelationships Among Measures of Individ- 
ual Differences 


The product-moment correlations among 
the three measures of individual differences are 
given in Table 1. The two measures of rea 

tions specific to achievement (n Ach, TAQ) 
were significantly correlated, as were the two 
reactions 


This 


measures concerned with negative 


towards general situations (MAS, TAQ) 


pattern of relationships is consistent with the 


specifications set for the measures by the 
theories to which they are related. The TAQ 
was developed to measure anxiety symptoms 
specific to an achievement situation. The MAS 
was developed to measure general feelings of 
anxiety which may or may not occur in an 
achievement A significant positive 
correlation is expected between them. The 
n-Ach measure is thought to reflect the degree 
to which S has expectancies of goal attainment 
in an achievement situation; such expectan- 


situation 


cies are incompatible with any anxiety reac- 
tions the A high 
negative correlation is expected with TAQ and 


associated with setting 


a low negative correlation with MAS 


Absolute Conductance Level in Null 1 Period 


The initial Null 1 


relation to the several measures of individual 


levels of conductance in 


IN AN ACHIEVEMENT SITUATION 


TABLE 1 


Propuct-MOMENT CORRELATIONS AMONG 
THREE MEASURES OF INDIVIDUAI 
DIFFERENCES 


Manifest 
Anxiety 


Test 
Measure 
Anxiety 


n Achievement 43* 24t 


rest Anxiety 
*+'s of .39 and .50 are significant at .05 and .01 levels of con 
fidence respectively 

t One subject failed to fill out the Test Anxiety Questionnaire 


correctly and was eliminated from all Test-Anxiety analyses 


rABLE 2 


CONDUCTANCE 
PERIOD 


MEAN ABSOLUTE in Nu 1 


Measure 


n Achievement 
lest Anxiety Questionnaire 
Manifest Anxiety Scale 


* Units in micromhos 


rABLE 3 


MEAN VARIATION IN CONDUCTANCE EXPRESSED 
AS PERCENTAGE OF Now 1 Level 
N = 8 for all subgroups) 


Period 


Measure Subgrouy 


Antici olu- | Null 
pation | tion 2 


High 102.5*| 95.4168 
Middle |101.1 |106.5/85 
Low 100.1 |110.0)79.¢ 
High 104.2 |109.2/79 
Middle |101.0 |103.1/86 
Low 103.8 | 89.5166 
High {106.0 |109.6)82 
Middle | 94.0 | 98.8/71 
Low 103.2 |101.8)78 


n Achievement 


lest Anxiety Question 
naire 


Manifest Anxiety Scale 


* The o diff. for each measure is as follows: n Ach: between = 
TAQ - 16.3, within = 64; MAS 


within = 7.6 


16.4, within = 7.4; between 


between = 16.5, 


differences are presented in Table 2. In this 
period, the anxiety-arousing instructions had 
not yet been introduced. However, on each of 
the three measures, the subgroup which is pre- 
sumed to be composed of relatively nonanxious 
Ss showed the highest conductance level. Al- 
though even the largest difference, that ob- 
tained between the subgroups scoring high and 
low on n Ach, was not significant (o diff. = 6.42, 
l 997), the consistency of the trends in each 
of the comparisons may be worth noting. They 
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TABLE 4 
MeAN VARIATION tn ConpucTANCE EXPRESSED 
AS A PERCENTAGE CHANGE OF No Lt |! 
LEVEL ror SuByects DEesIGNATED 
Anxious OR Nonanxiovus* 
Period 
} 
subgrouy N ' 
Antici rN 9 
Solution Null 2 
pation 
Anxiou ] 104.4 112.1 83.7 
Nonanxious 9 101.7 89.0 67.6 
Difference 2.7 23.1 16.1 
o diff 12.42 12.22 9.05 
t 22 1.89 1.78 
p (one-tailed 04 05 
*On the basis of scores on both the Test Anxiety and the r 


achievement measures 


are consistent with the notion that a high con 
ductance level may be indicative of psycho 
logical states other than anxiety 


Skin Conductance 
Periods 


The 


pressed as a percentage change over Null 1 


Variation in Experimental 


mean variation in conductance, ex- 
level in three experimental! periods, is presented 
in Table 3 for each of the subgroups differen- 
tiated by the three measures. The subgroups 
differentiated by measures of n Ach and TAQ 
showed significantly different conductance pat- 
terns. Though none of the between-subgroup 
differences were significant at any particular 
period, there were meaningful differences be- 
tween subgroups in the direction of the trend 
of skin the different 
experimental periods. The subgroups consid- 
ered relatively Ach, high 
TAQ) showed an increase in conductance from 
the Anticipation to the Solution Period. The 
relatively nonanxious 


conductance through 


“anxious” (low n 


subgroups considered 
(high n Ach, low TAQ) showed a decrease. The 
differences between these particular differences 


with both measures were significant in the 
predicted direction (n Ach: diff 17.0, o diff. 

10.5, ¢ = 1.62, p = 05; TAQ: diff. = 19.3, 
o diff 9.1, ¢ 2.12, p 025). 


The subgroup designated as anxious by the 
more general MAS did not show any pattern 
of change in conductance that was significantly 
differen 
nonanxious 

A final analysis was made, comparing only 
those Ss scoring both high in TAQ and low in 
n Ach (both treated in the past as symptomatic 


from the subgroup designated as 
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of a disposition to be anxious in achievement 
situations) with Ss who scored both low in 
TAQ and high in n Ach (both treated in the 
past as symptomatic of the relative lack of a 
disposition to be anxious in an achievement 
situation). The significant negative correlation 
between the two measures implied that by 
examining S’s score on both measures, it would 
be possible to obtain a more reliable estimate 
of the degree to which he might be anxious or 
nonanxious in a competitive situation. On each 
measure, the distribution was divided as near 
as possible to the median. Only Ss who fell in 
the corresponding high and low anxiety halves 
of both distributions were selected for com- 
parison. Mean variations in conductance for 
these subgroups are presented in Table 4. It 
can be seen that by this method of classifying 
Ss, the trends observed between subgroups in 
the separate analyses were further emphasized 
and were significant in the Solution and Null 2 
Periods. The directional change between the 
Anticipation and Solution Periods observed, 
significantly different in the separate analyses, 
was also significant here (diff. = 20.4, o diff. = 
10.1, 4 = 2.01, p = .03). 

The trends observed in the earlier analyses 
concerning the initial level of conductance in 
Null 1 were further emphasized in this method 
of classification. The subgroup designated 
nonanxious showed a significantly higher initial 
level of conductance than the subgroup desig- 
nated anxious (diff. = 7.2, o diff. = 2.88,1 = 


2.50, p = .02). 
DISCUSSION 


Skin conductance, having been traditionally 
considered a physiological index of anxiety, is 
here viewed as a criterion against which dis- 
criminations made by the three dispositional 
indices of anxiety may be checked. The findings 
imply that the more specific an assessment 
technique is to the situation in which the pre- 
dicted behavior is to occur, the more accurate 
is the prediction. This point is emphasized by 
the results obtained when Ss were classified 
according to their standing on both the n-Ach 
and the TAQ measures. By this method, the 
nonsignificant between-subgroup trends ob- 
served in the individual analyses were further 
emphasized, reaching a significant level in two 
of the three experimental! periods. Further, the 
difference in the amount of change between 
the two subgroups from the Anticipation to 
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the Solution Periods was accentuated as pre- 
dicted. 

An important question is raised by the 
present result: Of what importance are meas- 
ures which are designated to be “general indi- 
cators” of anxiety? Behavior always occurs in 
specific situations. If an individual is termed 
“generally anxious,” does it imply that he is 
anxious in all specific kinds of situations or 
just in a greater than average number? These 
results suggest that particular anxiety reac- 
tions are associated with specific situations. An 
assessment technique constructed to reflect 
response tendencies peculiar to a certain lim- 
ited class of situations will probably have a 
predictive advantage over more generally 
oriented methods. 

The analyses made in terms of the initial 
level of conductance in the Null 1 Period 
showed that in each of the individual difference 
measures, the subgroup considered to be the 
least anxious generally had the highest level of 
absolute conductance. Though these differ- 
ences between subgroups in the individual 
analyses were not significant, they did reach 
statistical significance when Ss were classified 
on the combined n-Ach-TAQ indices—pre- 
sumably the most reliable estimate of indi- 
vidual differences in disposition for achieve- 
ment test anxiety. Since the Null 1 Period 
occurred before the experimental instructions 
were given, the differences must be due to 
initial reactions to the experimental situation 
itself, unless differences in conductance are a 
stable correlate of the individual difference 
measures. 

Schlosberg (9) has argued that conductance 
levels should be considered an indicator of the 
degree of energy mobilization or “activation” 
of the individual in a particular situation 
Activation has no particular “sign” or direc- 
tion; it may stand either for fusitive interest 
in the task or for fear motivating avoidance. 
The Ss with a higher conductance level on 
entering the experimental room may be react- 
ing to the situation with positive anticipatory 
interest cued off by the complex apparatus that 
immediately faces them. This interpretation is 
consistent with the contention of McClelland 
et al. (6) that the high n-Ach subgroup is likely 
to be challenged by a problem situation. 
Burdick (2) similarly reporied a significantly 
greater tendency for persons high in n Ach to 
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volunteer to participate in psychological ex- 
periments. 

If Schlosberg’s theory of activation is cor- 
rect, it may mean that the increased conduct- 
ance level that the high n-Ach subgroup shows 
initially has quite a different “sign” than the 
change the low n-Ach subgroup shows later. 
The conductance measure itself is not capable 
of giving the direction of behavior implied by 
increased activation, but it is possible to infer 
direction from the way the two subgroups are 
conceptualized according to n-Ach theory. It 
is possible, therefore, to look at the conduct 
ance measure as an indicator of the degree to 
which an individual is activated by the cues 
of the situation. The direction of this activa- 
tion, either positive (approach) or negative 
(avoidance), can be inferred through an analy- 
sis of the content of human thought (6, p. 33). 
Neither the MAS nor TAQ allow this possi- 
bility since there is no direct way of obtaining 
evidence of positive orientation from them 


SUMMARY 


This study investigated relationships among 
three dispositional measures and two physio 
logical indices of anxiety in a competitive 
achievement situation. The dispositional meas- 
ures were relatively weak motivation to achieve 
as measured through content analysis of im- 
aginative stories (low need Achievement), the 
Mandler-Sarason Test Anxiety Questionnaire, 
and the Taylor Manifest Anxiety Scale. The 
physiological indices were skin conductance 
and respiratory volume. 

Twenty-five male 
placed in a competitive test situation and in 
structed to perform a complex perceptual! 
motor task. They were informed when they 


college students were 


made errors and were given cues to aid in 
performing the task accurately. The physio 
logical indices were recorded before the ego- 
involving instructions were given and immedi- 
ately before, during, and after the performance 
task. The three dispositional measures were 
obtained after the performance task. 

The correlation between need Achievement 
and Test Anxiety was —.43; between Test 
Anxiety and Manifest Anxiety, .53; between 
Achievement and Manifest Anxiety, —.25. 
The two measures specifically concerned with 
reactions in competitive achievement situa- 
tions (n Achievement, Test Anxiety), were 
both related to changes in skin conductance 
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during the performance task. The relationship 
was clearest when the subjects were classified 
on both of these measures as anxious (i.e., high 
Test Anxiety and low n Achievement) or 
nonanxious (low Test Anxiety and high n 
Achievement). The more anxious group in- 
n conductance while the relatively 
group The Manifest 


Anxiety Scale did not relate to conductance 


creased 
nonanxious decreased 
change and no consistent relationships were 
found between respiratory activity and any of 
the other measures. These results suggest the 
advantage of situational over general measures 
of anxiety in accounting for changes in skin 
conductance in a parti ular situation. 

It was also found that the highest initial 
level of conductance was obtained by the sub- 
group designated as nonanxious on each of the 
three dispositional measures respectively. 
These differences occurred before the introduc- 
tion of the anxiety-arousing instructions. These 
findings are interpreted in terms of Schlos- 


berg’s theory of activation level 
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CONDITIONING OF A RESPONSE CLASS ON A PERSONALITY TEST 
ANNE M. NUTHMANN' 


University of Wisconsin 


HE demonstration of verbal learning 

without awareness of what is being 

learned has been the subject of much 
controversy in psychological literature for 
many years. Recent studies (7, 8, 15) using 
a word-association technique report conflict- 
ing results, but positive findings (7, 8) are of 
very small absolute magnitude. Studies using 
other methods (1, 2, 3, 4, 12, 14) that might 
be expected to involve less interference from 
the subject’s (S’s) previous experience have 
found more striking evidence for 
without awareness. In some (1, 2, 3, 14), ver- 
bal stimuli contingent upon S’s response have 
been found to have reinforcement properties, 
but when mechanical stimuli such as a light or 


learning 


a tone are used, reinforcement sometimes does 
(4, 12) and sometimes does not (1, 14) occur. 

If learning can occur without awareness in 
areas relevant to personality formation when 
contingent stimuli are presented in the rela- 
tively simple fashion of these studies, the im- 
plications for psychotherapeutic methodology 
are great. A personality variable that has been 
given much attention in recent literature on 
“attitudes toward 
Changes from more negative to more 


psychotherapy is that of 
self.” 
positive statements about the self have been 
shown to occur in successful therapy (9, 10, 
11). Also scores on several scales of the MMPI 
have been shown to change significantly after 
therapy (5). There has been much less success 
in correlating therapist behavior with thera- 
peutic outcome (13), The present study is an 
attempt 
sponses indicative of acceptance of self on a 


to demonstrate an increase in re- 


personality test as a function of the contin 
gent stimuli “good” and a light blink 


METHOD 


Selection of items for the personality test. Several 


bundred items, Many from existing personality and 
interest inventories, the remainder devised by the 
author, were classified by 80 students of elementary 


psychology courses at the University of Wisconsin into 
three categories: acceptance of sell, rejection of sell, 
and items having nothing to do with acceptance or 
rejection of self. Acceptance and rejection of self were 


1 Now Great Falls 


Montana 


at Mental Hygiene Clinic, 
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defined for the students as follows: “One accepts himself 
by thinking he is an adequate person, admitting that 
he has strong points and weaknesses. One rejects himself 
by thinking he is an inadequate person or by refusing 
to admit weaknesses which are true of almost every 
body.” These definitions were written at the top of the 
list of items which the students were required to 
classify, but no example items of the three categories 
were given. An item was included in the experimental 
test as indicating acceptance or rejection of self if the 
difference between the proportion of students placing it 
in one of these categories and the proportion expected 
by chance was significant at the .001 level. To meet this 
criterion, it was necessary for 57 out of the 80 students 
to place it in the same category. The two categories, 
and “rejection,” were then 
since an answer of “false’’ to a rejection item indicates 


“acceptance” combined 


acceptance. To reduce the probability of the experi 
mental Ss becoming aware of the conditioning process, 
buffer items from the third category were added 

rhe resulting test was administered to a new group 
of 80 the of 
acceptance-ol-self answers to each item and to eliminate 


students to determine relative number 
items that were not answered in the opposite direction 
from the majority by at least 7 of the 80 students, since 
rhe 


indicating 
final test 


these items would be unlikely to be conditioned 
number of 
acceptance of self were then made equal 


“true” and “false” answers 
The 
consisted of 100 ac« eptance-ol-sell items and 40 buffer 
items 

Selection of subjects 


to still another group of 420 students 


Ihe test was then administered 
The Ss for the 
experiment were chosen from students scoring in the 
lowest 15 per cent of the acceptance-of-self distribution 
They were matched on the basis of these scores and 
assigned of the 
“good,” light, or control 


to one three experimental groups, 


15 Ss in each 
group. Between one and two months after this pretest 


There were 


the experiment was conducted 

Apparatus. In the smail room used for the experi 
ment, the principal piece of apparatus was a 22- « 24 
inch fiberboard partition painted flat black and mounted 
vertically on a table so that it separated S and EZ. A 
2\4- & 4%-inch opening in the center of the partition 
x S-inch white 
containing the items. Immediately above the opening 


served as a display for 3 index cards 
was a hole 1 inch in diameter covered with white paper, 
which ceuld be illuminated by a 6-v. bulb on 2’s side 
of the partition to produce an evenly diffused signal 
light to S 
and | 


were placed 2 


Lever-type microswitches labeled T (true 


false) by means of white paint on the partition 
inches above the base of the partition and 
8 inches in from either side. The switches were wired 
to 6-v. bulbs on 2’s side of the partition, and signaled E 
which microswitch had been depressed. The light signal 
ing 5 was controlled by a silent switch that could be set 
by £ to go on when the T or F switch was depressed, 
or not to go on for either. The items, typed on the 
index cards, 


containing 10 items for which ‘ 


were divided into five blocks, each block 


true” was the accept 
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ance-of-self response, 10 items for which “false” was 
the acceptance-of-self response, and 8 buffer items. The 


blocks were matched for the number of acceptance-of 
self answers to the items within them given by the 
group of 8) students to whom the test was first admin 
istered as a personality test 
of the blocks was varied according to a 5 X § latin 
square to control further for possible unequal condi 
tionability of the blocks 

Conditioning 
seated at the table facing the fiberboard partition and 
was read the following instructions 


The order of presentation 


In the experimental session S was 
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“Several weeks ago in your Psychology 1 class you 
took a personality test. I am interested in the effect that 
different ways of presenting personality statements has 
on the way people reply to them. You will now be 
given the same test you had before only in somewhat 
different form. Through this window you will be shown 
a series of cards on which are typed the personality 
statements. You are to read each statement carefully 
and decide whether or not it is true or mostly true with 
respect to yourself as you feel right at this moment. 
Then press the appropriate true or false switch in front 
of you. Hold the switch down for about one second, 
like this (2 demonstrates), but do not press a switch 
unless a card is showing in the window. Try not to take 
too long to decide on your answers and don’t worry about 
being consistent. It is a very common occurrence for 
people to change their answers on personality tests 
when they take them a second time. I am interested in 
your first impressions right now. I am not keeping a 
record of your name as I am interested only in group 
results, so please be as honest as you can in your 
responses. Are there any questions?” 

The £ then began to present the cards, the rate of 
presentation being determined by S’s response rate 
For the two groups to be conditioned, whenever S 
gave an acceptance-of-self response it was followed 
immediately by the appropriate contingent stimulus 
For ail groups E then removed the card, recorded the 
acceptance-of-self response, and presented the next 
card. For the light group £ set the light-control switch 
immediately before presenting each card 

After all 140 cards had been presented, Ss in the 
light group were asked the following question, ‘““What 
did you think the purpose of the light was?” The Ss in 
the “good”’ group were asked, “Did you think that any 
thing I said had anything to do with your responses?” 
On the basis of Ss’ answers to these questions £ deter 
mined awareness of the conditioning process 


RESULTS 

The principal findings of the experiment are 
shown in Fig. 1, in which the mean number of 
acceptance-of-self responses for the five suc- 
cessive blocks of items is plotted for each of 
the groups. The distributions within the vari- 
ous subgroups’ were moderately skewed, and 
sartlett’s 
the Ss 
within groups, a chi square of 7.52 was ob- 
tained. For 2 df this is significant between the 


when the data were subjected to 
test for homogeneity of variance of 


.O5 and .02 levels, indicating moderate hetero- 
geneity of variance. Lindquist (6) suggests 
that nonnormality and heterogeneity of vari- 
ance do not have an appreciable effect on the 
validity of the / ratio unless these deviations 
are marked. Alexander trend 
analysis was carried out on the data of the 


Therefore an 


three groups after a Bartlett’s test of the slope 
terms failed to show heterogeneity of variance 
(chi square of 3.29 for 2 df). The results of the 
trend analysis are summarized in Table 1. The 
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F of 3.52 for Between Group Means indicates 
that the means of the three groups differed to 
an extent significantly greater than could be 
accounted for by individual differences. The 
F of 4.03 for Between Group Slopes was also 
significant beyond the .05 level. Inspection of 
Fig. 1 indicates that both of these differences 
are almost entirely due to the progressive in- 
crease in acceptance-of-self responses for the 
group receiving the contingent stimulus 
“good.”” The curves for the light and the 
control groups are very similar over all five 
blocks, indicating that the light was not a re- 
inforcing stimulus. 

With respect to the performance level of the 
three groups at the end of Block 1, it is inter- 
esting to note that although the groups were 
matched on the basis of performance on the 
pretest before the beginning of the experiment 
it was not possible to match them exactly. 
The average score for Ss in the control group 
on the first test was slightly higher than that 
of the other two groups. The “good” group 
overcomes this handicap in the first block of 
items, suggesting that conditioning takes place 
from the beginning. However, though it is evi 
dent that the “good” group as a whole in- 
creased in acceptance-of-self responses over 
the five blocks, not all of the Ss in this group 
did so. 

A latin-square analysis of the data was also 
carried out to determine whether the individ- 
ual blocks differed significantly from each 
other. The F of 5.98 for blocks was significant 
at the .001 level, and indicates that certain 
blocks of items were easier to condition than 
others, in spite of the fact that the items were 
originally assigned to the blocks on the basis 
of equal popularity. 

In the “good” group five Ss, and in the 
light group four Ss, became aware of, i.e., 
could verbalize, the purpose of the experi 
ment. It is fairly certain that those who could 
not verbalize the 
were unaware, because for the most part these 
Ss verbalized a contradictory hypothesis, 
many of them stating that reinforcement ox 
curred whenever their response was consistent 
with that of previous testing. Since no over-all 
increase occurred for the light group, only the 
results of the “good” group were analyzed fur- 
ther. An Alexander trend analysis of the data 
for the aware and unaware Ss of the “good”’ 
group showed that neither the difference be- 


correct purpose actually 
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tween the group means nor the difference be 
tween the group slopes was significant, the F 
ratio for the slope difference being less than 
1.00. The aware group, however, did respond 
at a slightly higher level than the unaware 
group at all points. 

Since the Ss for this experiment represented 
one extreme of a distribution, one might ex- 
pect a regression toward the mean to occur 
for all the groups on a second testing. There 
actually was an over-all average increase in 
total acceptance-of-self responses from the 
pretest to the experimental test. In the con- 
trol group this average total increase was 6.74 
responses, in the light group 6.53 responses, 
and in the “good” group 16.86 responses. In 
the “good” group 7 of the Ss increased 20 


responses or more; 11 of them increased 10 


resjx ynses or more. 


DISCUSSION 


The results of the present experiment dem- 
onstrate that it is possible to learn without 
awareness, defined in terms of verbal report, 
to respond in a more self-accepting way to 
items on a personality test. Here, as in other 
experiments on learning without awareness, 
one may ask exactly what it is that the Ss 
learned. In the studies (3, 4, 12) where plural 
words were conditioned, for example, did the 
Ss learn the concept plurals or did they learn 
to say words ending with the s sound? In the 
present case it is obvious that the Ss did not 
learn to respond in a certain way to specific 
items, because each item occurred only once. 
They did not learn a “true-false” pattern of 
responding because the items to be conditioned 
did not appear in such a systematic order 
Also a given order of presentation of the 
blocks cannot account for the group increase 
in acceptance-of-self responses since this order 
of presentation was varied in a latin square 
design. Another possibility arises as to the 
effect of differential wording of the items to 
which “true” and “false” were the acceptance- 
of-self responses. Very few of the rejection-of- 
self items were worded negatively, so that the 
Ss could not learn to respond “‘false’”’ to nega- 
tively worded items. It is true, however, that 
most of the times for which “false’’ was the 
acceptance-of-self response did contain words 
that might be classified as being negative in 
emotional the ‘true 
ance-of-self items in general contained words 


tone, whereas ” accept 
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Whether emotional 
tone or the broader concept of acceptance of 


of positive emotional tone 


self was actually discriminated remains open 
to question. In favor of the latter alternative 
is the consideration that if discrimination of 


] 


emotional! tone did occur, some of the Ss should 


have been able to formulate it verbally, since 


some of them were able to verbalize the more 


abstract concept of acceptance of self. For the 


purpose of this discussion, therefore, it is as 


sumed that discrimination was in terms of self 
acceptance 


The 
be capable of bringing about this increase in 


verbal stimulus “good” was shown to 


acceptance-of-self responses. There were, how 
ever, large individual differences in the amount 
of conditioning. Two factors may be suggested 
to account for these Liffe rences. As a result of 


1 differential degree of past learning by the 


Ss of responses involving their self concept, it 
may take longer for reinforcing stimuli to 
bring about a measurable change in some Ss’ 
bel or than in the behavior of others. That 
| . ho have re ponded, overtly or covertly, 
I ar orthle hundreds of times may take 
lone to change th response than Ss who 
have t re ponded only a few times. Also 
the stimulus ‘“‘good’’ may have different rein- 
for ilue for different Ss as a result of 
their previous experiences with it 
The behavior of Ss in the group who were 
iware of the &’s intent, it has been noted, did 
no ffer significantly from the behavior of 
those ho were unaware. Four of the five 
iware’”’ Ss volunteered the additional infor 
mation that they were quite sure that they 
had not been influenced by the reinforcement 
The results obtained with the verbal stimu 
lus od”’ may have important implications 


for psychotherapy. It is possible that some of 


the changes in a patient’s verbalized self con 


cept that occur in therapy are the result of 


simple verbal reinforcement given, perhaps 


sometimes unknowingly, by the therapist 
he light blink, however 


reinforcing stimulus in this 


clearly was not a 
ituation, not even 
for the four Ss who were aware of the purpose 
light. This 


previously 


offers evidence to 
j 


noveste 
iggested 


of the finding 


support a hypothesis 
(14) that a situation of the 


conducive to conditioning by means of a light 


clinical sort is not 


Perhaps in order to function as a reinforcer of 
verbal behavior a stimulus must in some way 
connection 


be familiar to the S as having a 
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with his responses. In the studies that found 
the light to be reinforcing (4, 12) Ss were not 
responding with meaningful verbal material, 
but were simply saying words. Presumably 
the Ss had never experienced this type of 
situation previously, and therefore a blinking 
light would not be a strange resultant of their 
behavior. On the other hand, in the studies 
obtaining negative results with a light (1, 14) 
and in the present study, the Ss certainly had 
much previous experience with using mean- 
ingful sentences but most probably had no 
experience with being confronted with a light 
blink as a conveyor of feedback 


SUMMARY 


The purpose of the present experiment was 
to the value of the 
verbal stimulus “good”’ and a light blink stim- 


ascertain reinforcement 
ulus on a class of behavior designated as ac- 
ceptance-of-self responses on a true-false per- 
sonality test. Each item in this test that was to 
the basis of 
agreement among student judges significantly 
different from chance at the .001 level that the 


be conditioned was chosen on 


item measured acceptance of self. The test was 
then administered to 420 psychology students, 
and Ss for the experiment were chosen from a 
subpopulation receiving low acceptance-of-self 
scores 

Che Ss were assigned to one of two experi- 
mental groups or a control group after having 
been matched on the basis of their perform- 
ance on the pretest. In the experimental situa- 
tion the items were presented to the Ss indi- 
vidually on index cards. For Ss in the experi- 
true or false answer 


mentat each 


indicating acceptance of self was reinforced by 


groups, 
E’s saying “good” (in one group) or by an 
immediate light blink (in the other.) In the 
control group no feedback of any kind was 


The S’s awareness of 


given the conditioning 
procedure was determined from his verbal re 
port at the end of the experimental session. 
The results indicate that (a) learning with 
out awareness to respond positively to accept- 
(b) the 


behavior of Ss who are aware of the purpose 


ance-of-self statements can occur; 
of reinforcement does not differ significantly 
in this situation from that of Ss who are not 
aware; (c) the verbal stimulus “good” is cap- 
able of bringing about such learning whereas 


the nonverbal light stimulus is not. 
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THE GENERALITY OF 


ATTITUDES TOWARD AUTHORITY 


AND NONAUTHORITY FIGURES':? 
LEROY S. BURWEN 
Research Division, Chicago Tribune 
anp DONALD T. CAMPBELL 
Northwestern University 


HE collection of men into large-scale 
coordinated organizations typically in 
volves a hierarchical order of authority 

in which the great bulk of persons are placed 

between a superior and a subordinate echelon 

‘T his 

tance to the description and diagnosis of any 

if 


one 


ommon situation gives general impor 


generalized attituce toward authority, 


indeed such exists. Furthermore this is 
predictive setting in which all available psy 
to Writers 
varied orientations as Freud, Piaget, 
News omb,' Krech Crut hfield, 


gree 


choloyical theories seem 


rt agree 


with as 
Roge rs and 


etc., a in positing attitudes towards 


parental authority originating in the family 
situation which generalize and manifest them 
later 


selves as toward 
Piaget, for example, writes 


attitudes superiors in 


socia ituations 


] 
AS follows 
Day to day observation and psychoanalytic experience 


that the 
generalized and applied to many people 


show first personal schemes are afterward 
According as 
the first inter-individual experiences of the child who is 
just learning to speak are connected with a father who 
is understanding or dominating, loving or cruel, 


the child will tend (even throughout life if these rela 


etc., 


tionships have influenced his whole youth) t 
late all other 


assimi 
individuals to this father scheme (7, p 


207 
rhere is also a widespread awareness among 
clinical psychologists that problems of sibling 


rivalry repeat themselves in peer relation 


ships in adult life. Whether these attitudes or 


response yeneralizations are mediated by 


stimulu milarity in terms o! age and physi 
cal appearance; by language habits in which 
the terms father, old man, and boss are used 


| 


interchangeably; or by learned stimulus equiv 


' This study was sponsored in part by the United 


United States Air Force under contract No. AF 18 
(600)-170 monitored by the Crew Research Laboratory, 
Air Force Personnel and Training Research Center, 
Randolph Air Force Base, Randolph Field, Texas 

* This arficle is in part based upon a dissertation sub 
mitted the senior author to the University of 
Chicago in partial fulfillment of the requirements for 


the degree of Doctor of Philosophy 
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alence, role similarity, or other nonspatial con- 
figurational attributes, the predictions remain 
the same. The present study attempts to put 
such predictions to test. 

In terms of methodological orientation, this 
study attempts to measure a smal! number of 
attitude dimensions through a large number of 
The 
more common approach in which a large num- 
ber of traits are measured by a single method 
leaves the investigator unable to distinguish 
relation- 


independent measurement techniques. 


meaningful psychological 
ships and such irrelevant sources of common 


‘re. 


between 


variance as ‘“‘apparatus factors” (9) and 
sponse sets” (3, 4). The approach here taken 
congruent with the of “construct 
validity” (8). All the measures 
are accepted as subject to both random and 


is notion 


introduced 


systematic error. None is set up above the 
Validation becomes a 
among 4a 


others as a “‘criterion.”’ 
process of mutual confirmation 
series of measures all directed toward the same 
psychological dimension, but maximally in- 
dependent in terms of apparatus, conditions 
of administration, and the like. 


METHOD 


rhe data were collected during January and Febru 
ary of 1953 from 155 men assigned to 17 bomber crews 
at Randolph Air Force Base. The men were in crew 
assembly training in preparation for action in Korea 
and had been together for approxiriately two weeks 
The 73 officers were typically men in their late 20’s and 
early 30’s with World War II experience. The 82 
enlisted men were largely in the 19-23 age range in 
their first tour of military duty. Several crews were 
assembled at the same time for the group testing. The 
two hours of testing represented here was only a 
portion of the research testing to which they were 
subjected. In the testing there was emphasis upon the 
fact that the data were for research purposes and would 
not become a part of any man’s record. The practical 
anonymity and mutual support of the group situation, 
plus general characteristic of the 
impending transfer to Korea, resulted in a testing 
situation characterized in some cases by perfunctory 


more attitudes 


compliance and occasional humorous sabotage of the 
test purpose. In the end, between 10 and 15 per cent of 


the answer sheets had to be discarded. In addition to 
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the group tests, 57 interviews were conducted indi 
vidually on a basis of compulsory assignment. Particu 
larly for the enlisted personnel, this setting created a 
guarded, deferential attitude that was difficult to 
overcome during the one-hour period involved 

Seven different 
employed 
attitudes were sought 
authority (as might be shown, for example, in responses 


measurement approaches were 
these, as 


father 


Through each of appropriate, 


toward: own symbolic 
to the picture of older persons in the TAT); immediate 
boss (the aircraft commander for most crew members 

immediate peers (fellow crew members); and symbolic 
peers. In all, 21 separate variables were scored. Unless 
otherwise specified, the number of cases upon which the 
reliability coefficients are based is 155, except for the 
interview measures, where the N is 57. Officers and 
enlisted men have been pooled in the analyses that 
follow, in order to provide maximum stability for the 
ofhicer-enlisted dif 


ceeficients computed. Significant 


ferences were found on only 3 of the 21 measures 


Procedures 


1. Interview.* The interviews focused upon family 
background and the attitudes toward the current Air 
Force situation. The subject (S) was drawn out on 
attitudes toward his father, siblings, present superior 
officers, and peers. Each of these four attitudinal areas 
was then scored on a five-point scale for favorability or 
unfavorability. These ratings were made on what the 
interviewer inferred the covert attitudes of the S to be, 
rather than simply the overt attitudes expressed. While 
no direct estimate of the reliability of interview ratings 
is available, a minimum estimate might be the squared 
correlation coefficient of the interview variable with 
other test variables (see Table 1) 

2. Thematic Apperception Test. TAT-like 
were administered by projection upon a screen while 
the Ss their 
respondents was: “You are 
picture which will explain what has happened and how 
things are going to turn out.” Eight pictures were 
used, three from the Murray set (1,6BM and 7BM) and 
five specially designed for Air Force use.‘ Cards were 
Symbolic 


pictures 


stories. The assignment to 


to tell a story about this 


wrote out 


selected so as to elicit attitudes toward 

Authority and Symbolic Each of thes 
variables was scored on a five-point rating scale ranging 
from highly unfavorable characterization of the person 
to highly favorable characterization. Older persons and 
persons of higher military rank were assumed to be 


figures while peer-age persons 


Peers attitude 


symbolic of authority 
and persons of lower rank were interpreted as 
The interjudge reliability for 
a correlation coefficient 


symbolic 


peers or subordinates 
these ratings is represented by 
of .83 between one judge and ancther on a sampling of 
92 stories 

Internal-consistency reliability is dificult to estimate 
on a tree-response test where not every person gives a 


scorable Seventy-one Ss 


response to every stimulus 
4 The senior author, with a background of four years 
of clinical work, conducted the interviews and scored 
al] measures involving clinical or subjective judgment 
‘The authors are indebted to Dr. William Soskin 
for two of these pictures; three were designed by the 


senior author 
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TOWARD AUTHORITY 25 
gave some scorable Symbolic Peer descriptions on two 
of the cards. The correlation between the ratings 


received on these two cards was .36, which corrected 
by the Spearman-Brown formula yields a reliability 
of .53 for a upon the two cards 
Scorable responses for Symbolic Peer also were made 
The mean number of scorable Symbolic 


total score based 


to other cards 
Peer responses per S for the entire 155 Ss was 1.8. One 
hundred and eight Ss gave scorable responses to pre 
sumed authority figures on four of the cards. Correlat 
ing the sum of the ratings on two of these cards with 
the sum of the ratings on the other two yields an r of 
55 for the reliability of the sum 

The mean number of scorable 


38 which corrects to 
of the 
Symbolic Authority responses for the total 155 persons 
thus the reliability based upon four cards is 
Phe scoring pro 


four card scores 


was 6.2: 
presumably a slight underestimate 
cedure employed was somewhat more direct than many 
clinical TAT interpretations. In particular, no use was 
made of the presumed ego representation in the story, 
dynamic interpretations 


It was 


nor of more complicated 
(e.g., such as would involve reaction formation 
therefore thought desirable to rerate the stories on a 
more clinical basis, using whatever complex cues might 
be available. It is important to note here, however, that 
the stories contain much less complexity than those 
provided by neurotic persons with middle-class back 
grounds. These global clinica] ratings are designated by 
TATg while those based upon scoring the descriptions 
of specific persons summed over the eight pictures are 
indicated by TATo 

3. Description of Self and Others. In this test the 
respondents were asked to evaluate other persons by 
indicating whether each of 30 different trait terms 
applied or did not apply Eight of the 30 terms were to 
be marked as applying; eight as definitely not applying; 
and the rest omitted with a intermediate 
weighting. The 
superior (boss) and a fellow worker were scored in terms 


resulting 
descriptions of father, immediate 
of the degree to which favorable terms were used in the 
description. The reliabilities* are as follows: father, .24; 
boss, .34; peer, .55 

4. Photo Judging Test 
were asked to characterize 
photographs. It was assumed that the photographs of 
middle-aged and older persons would be most likely 


In this test the respondents 


briefly persons shown in 


to tap attitudes toward those superior in authority 
and that reactions to the photographs of high school 
likely to tap attitudes 
toward subordinates and peers. For the present scoring 
purposes the responses to 20 photographs of men over 
45 form the basis for the score on attitudes toward 
Symbolic Authority. The responses to five photographs 
of high school boys provided the score labeled Sym 


age persons would be most 


* Unless otherwise noted, the reliabilities referred to 
for this and following measures are internal consistency 


measures of the Kuder-Richardson type, with the 


computation using item variances and total score 
variances, following Gulliksen’s Formula 10 (5, p. 223 
If the assumptions underlying this coefficient are not 
met, the reliability is systematically underestimated 
It is worthy of note that although the reliability of 
father description is only .24, it correlates .40 with both 
the interview rating on father and with the father 
score obtained from the autobiographical inventory 
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rABLE 2 
Mean Correcations sy Tor 
I 
I 

k A BK I P 
Fathe I 3 12) .03) .06 .08 
\uthorit A l O08} 10) .06 
Bo K 09 .13 .03 
Peer P 22. .07 
vi oO Peer Pp 01 
bolic Peer. The assignment to the Ss was to “Write 
down what you think the person in the picture is like.” 
Space was left for 2 or 3 sentences, although in general 


Phese 


terization was favorable, 


a listing of trait terms was given responses were 


rated as to whether the chara 


intermediate, or untavorable Interjudge agree ent in 


making these ratings was .95 for one pair of judges and 
other. Internal 
Authority 

tulubiographical Inventory 
tained both 
items. The 


for reporting here 


reliability for 
Symbolic Peer, .29 
This inventory 


items 


BO ior ar consistency 


Symbol was .46, for 
con 
check-list 


short free and 


full detail 


response 


of the questionnaire are too long 


and, as with all instruments discussed 


in this paper, have been reported fully elsewhere (2 


Scoring involved both subjective ratings of [ree-response 


items and direct scoring of check-list material. Inter 
judge agreement on a imple of 100° subjectively 
evaluated items was .46. There were eight items on the 


yielded an 
Attitudes toward 


S’s relationship to his father and these 


internal consistency reliability of .56 


bo this instrument included present boss and pre 
vious supervisors and teachers. There were 16 such 
items which yielded a reliability of .56. There were 11 


items describing relationships with peers, including 


classimatt childhood friend and fellow workers, 


giving a@ reliability of .55 


6. Altitude Survey. The testec personnel were also 
given an attitude survey developed for the study of 
morale by the Crew Training Research Laboratory, 
Randolph Air Force Base. Ten of the items of this test, 
all of which were statements to be endorsed in one of 
five levels of intensity, seemed to reflect attituce 
toward authority in the military situation These 


items included competence of superior officers and 


and general favorableness of attitudes 
Air Force as an adi 


the re onscs 


instructors 


toward the nistrative unit. Scoring 


vith five levels of weighting, the symbolic 


authorit core from this instrument had a reliability of 
59. ‘Ten it reflected on attitudes toward members 
of the crew as peers and had a reliability of .73. The 
peer it were considerably more homogeneous 
topically than the Symbolic Authority ite These 
items presented attitudes toward actual peers in the 
working tuatio while the authority items were of 
such a general nature that we have classified them as 
attit towards Symbolic Authority 

7. Sociometric Questionnaire. The “Position Descrip 
tion Test" developed by the Crew Training Research 
Laborator re juired each crew member to pick those 
fellow cre embers whor he would like to be with 
in five hypothetical situations, including a crew party, 
shared work responsibility, bailing out behind enemy 
li Fach pers ittitude toward , vas 
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based upon the relative proportion of his mentions 
given to the aircraft commander. (Aircraft commanders 
as respondents were left out of this analysis, reducing 
the total number of cases to 138.) Peer scores for the 
officers were based upon the nominations given the 
other officers; for the enlisted personnel, the score was 
based upon the responses given to the other enlisted 
persons. For boss, the reliability was .75. For peer, the 


values were .84 for officers and .90 for enlisted men 


RESULTS 


Table 1 presents the product-moment cor- 
relations among these 20 variables. The vari- 
ables are arranged according to the a priori 
attitude groupings and the expected pattern 
of correlations. The a priori expectations were 
of generalized attitudes among the measures 
dealing with father, symbolic authority, and 
boss and also homogeneity of responses among 
the various measures focused on peers and 
symbolic peers. A relative independence be- 
tween these two major sets of attitudes was 
exper ted 

The ubiquitousness of “apparatus factors’ 
(9) is indicated by the fact that for 15 of the 
20 variables the highest correlations were ob- 
tained between measures of different attitudes 
based upon the same instrument. For only 3 
of the 20 variables is the highest correlation 
“same” attitude 
through a different This “appa- 
ratus factor” effect is particularly marked for 


with another measure of the 
instrument 


the Interview, TAT, and Sociometric measure. 
In Table 1, 


instrument have been put in parentheses so 


correlations involving the same 


that they may be discounted in the interpre- 
tation. It is obvious that in general the other 
correlations are considerably lower than ex- 
pec ted 

In spite of the disappointingly low genera 
there is a slight general 


level of correlation, 


positive trend. Disregarding the intra-instru 
ment coefficients, the values range from 28 
to +.56. Of the 169 coefficients not involving 
the same instrument, 143 are positive, differ 
001 
level, as tested by the binomial expansion. 

this 


attitudes toward 


ing from a chance distribution at the 


lo examine matrix for evidence of 


speci father, boss, peers, 
etc., Table 2 has been prepared from Table 1 
the 


eliminating 


by averaging by z transformation cor 


relations within each of the 
the 
weighting by number of cases (which reduces 
the the 


If speciti attitudes exist. 


cells, 


those based on same instrument, and 


influence of interview measures) 


correlations among 
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rABLE 3 


COMPARISON OF CORRELATION Parrs TO TEST THE 
ASSUMPTION OF STIMULUS GENERALIZATION FROM 
FATHER TO Boss 


Measure of Boss and 
Peer Attitudes 


Father Corr © 
Measure With a Core 
cg 
) S 
Int oO Al x 
Int Boss 64 19 06 19 \& 
Father Peer 65 27 O4 04 
S&oO Boss Os 23 14 06 14 
Father Peer 09 21 00 O8 
A. I Boss 15 09 O8 03) \% 
Father Peer 24 03 15 OSs 
12 


variables focused upon the same attitude 
object should be higher than correlations 
among measures involving dissimilar attitude 
objects. The data in Tables 1 and 2 indicate 
that this requirement is clearly met only for 
attitudes toward father and toward peers. 
It is most clearly not met for symbolic peers 
and boss. Even where met, the level of inter- 
correlation is much lower than one might 
have expected. . 

In attempting to interpret these findings 
it is well to look more closely at the assump 
tions involved in giving the different measures 
similar labels. In anticipating some equivalence 
among the various measures of attitude to 
ward father we assume response generalization 
or response equivalence to a relatively con 
stant, identifiable stimulus. The same assump 
tion is involved for the measures of attitude 
toward boss and peer, with the exception of the 


scores taken from the autobiographical 


inven 
tory, in which the measure is based upon re 
sponses to a number of previous bosses and 
previous peers. Removing the autobiographi 
cal inventory measures for boss and peer 
should thus improve the extent to which they 
refer to a single attitude object. So doing 
raises the average correlation among the 
measures of peer attitude about .05 points 
and has a similar effect upon the correlations 
among measures of attitude toward boss 
However, even with this alteration, measures 
of attitude toward boss still fail to meet the 
simpie requirement that the measures cor 
relate higher among themse!ves than with 


measures fo used on other attitude objec ts: 


even with all autobiographical inventory 


measures removed, measures of attitude to 
ward boss correlate more highly with measures 
of attitude toward peer than they do with 
other measures of attitude toward boss. 

The other assumption employed in the 
labeling of the attitude measures is that of 
stimulus generalization. It is on the basis of 
assumed stimulus generalization that the 
symbolic peer attitude measures were given 
similar labels, and that the symbolic authority 
attitudes were placed in the same cate 
gory. On the basis of one of the most wide 
spread hypothe ses in the psychology of per- 
sonality we also assumed generalization from 
father to symbolic authority and to boss (1, 
6, 7). In Table 2, it is clear that the three 
presumed groups with respect to attitude 
toward authority (father, symbolic authority 
and boss) correlate no more highly with each 
other than they do with the two sets of peer 
measures. Thus while we did find some evi 
dence of specific attitudes toward father, we 
find no evidence at all for a generalized “atti 
tude toward authority” which would include 
in its manifestations attitudes both toward 
father and toward persons in supervisory 
positions. This finding is so out of keeping 
with general theoretical anticipations that it is 
worth examining in greater detail 

rhe comparisons just made are somewhat 
unsatisfactory because of the variability or 
heterogeneity of the correlation coefficients 
within the blocks averaged. Not all of the 
attitudes have been measured with all of the 
instruments, and the instruments differ in the 
strength of apparatus factors. In addition, 
stimulus generalization or habit similarity 
may be expected on a priori grounds, involving 
social objec ts sharing the same temporal and 
spatial situation, and leading in particular to 
factors centering around the early home, on 
the one hand, and the present working situa 
tion on the other. The correlation between 
peer and boss attitudes might, for example, 
be due to the fact that they are both specih 
manifestations of a more generalized attitude 
toward the current working situation. To 
achieve comparisons in which these factors 
are held constant, we have extracted from 
Table 1 specific pairs of correlations that are 
comparable with regard to the instrument and 
the situational! factors involved. Some of these 


correlation pairs are shown in Table 3. All 
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of the pairs of coefficients in this table repre 


sent correlations of some one measure of 
attitude father 
boss and peer attitudes from the same instru- 
ment. The boss and peer attitude measures 
are comparable, inasmuch as they represent 
the same environmental factor and are diag- 


the same instrument. They differ 


toward with measures of 


nosed by 
only in their predicted similarity to father. 
As indicated in Table 3, interview attitude 
father correlates .64 
65 with interview 
Interview attitude 
19 with Description 


toward with interview 
attitude tov 
attitude 
toward father correlates 
of Self and Others attitude 
and .27 with Des niption of Self and Others 


Each of the pairs 


ard boss, and 
toward peers 


toward boss, 


attitude toward peer, etc. 
is comparable in terms of the extraneous fa 
and situational 


tors of instrument 


sources of variance. If there is stimulus equiv- 


type 


alence between father and boss, the correla 
tion with boss within every pair should be 
higher than the correlation with peer. Similar 
sets are presented for each of the three ap- 
proaches to measuring attitudes toward 
father. The column headed 
the proportion of instances in which the ex 


only 


“score” indicates 


pected relationship was obtained—in 
three pairs out of the twelve, over all. Thus, 
not only do these carefully selected compari 
sons fail to support the existence of stimulus 
generalization father to 
they a trend approaching statistical 
significance 

In a manner similar to the analysis in Table 


from boss but also 


snow 


in the opposite direction. 


3, a number of other comparisons are possible. 
A higher correlation may be expected be 
attitudes toward father and attitudes 

symbolic authority than between 
ird father and attitudes toward 
symbo PAT 
the TAT scored objectively, and the 
Judging Test all 


toward s\ 


tween 

toward 
attitudes tow 
scored globally, 
Photo 


yield measures of attitudes 


pec rs The 


mbolic authority and symbolic peers 


itions between these two variables 


If correl: 
ind the three measures of attitudes toward 
father used in Table 3 are compared, a success 
ful prediction is achieved in only 4 out of 9 
Cases. 
Throug! 
ing addi 
potheses were tested, with the outcomes indi 


analogous procedures the follow 
tional stimulus generalization hy 


symbolic authority and boss should 


cated: 
correlate higher than symbolic authority and 


peer (10/20); boss and symbolic authority 
should correlate higher than boss and sym 
bolic peer (7/12); peer and symbolic peer 
should correlate higher than peer and sym- 
authority (8/15); symbolic peer and 
peer should correlate higher than symbolic 


peer and boss (7/12). Needless to say, none 


bolic 


of the outcomes approaches statistical sig 


nificance 


DISCUSSION 


At the inception of this study it was an 
ticipated that persons bring with them into 
any social situation a repertoire of attitudes or 
habits which modify their interaction with the 
persons they meet. The literature of social 
psychology and personality supports the ex- 
pectation that prominent within this reper- 
toire of habits or generalized attitudes are 
attitudes toward authority derived from pre 
vious encounters with authority, especially 
in the early family situation this 
kind of anticipation is an assumption of a 
symbolic similarity between father and other 
kinds of authority and a generalization of 
attitudes from one to the other. With respect 
to the existence of any such generalized atti 
tude authority, independent of a 
still general attitude toward 
or a still more specific attitude toward specific 
boss, this study has been 


Basic to 


toward 
more people, 
father or specifi 
entirely negative. Furthermore, it has shown 
no evidence of stimulus equivalence between 
symbolically represented authority figures 
and actual authority figures or symbolically 
figures actual peer 


represented and 


figures. These findings are so out of keeping 


peer 


with the general assumptions of current social 
and personality psychology of all schools that 
they are worth some special consideration 
Che simplest interpretive device in situa 
tions such as this is to reject the appropriate 


When 


struments are employed, one at a time, this 


ness of the instruments used the in 
approach is usually convincing, for in the 
application of any particular instrument there 
are many reasons why it is not wholly appro 
priate to the task at hand. In a multi-instru 
ment approach, such as essayed here, it was 
certainly anticipated that some tests would 
prove better than others. However, the gen 
eral level of confirmation of the hypotheses 
was so low that discriminations among the 
tests as to their efficacy or appropriateness 
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cannot be made. If tests are to be rejected, 
they must be rejected collectively. While the 
writers cannot claim any special excellence 
for the instruments employed, they do feel 
that the 
range of approaches unlikely to share com- 


mon faults, and that collectively the battery 


selection represents a considerable 


is probably representative of the types of 
approaches that students of personality or 
social psychology would be likely to assume 
to be relevant for the problem studied. Re 
jecting the instruments is appropriate only 
if one is prepared to suggest a number of other 
instruments that would be superior and to 
specify the kind of application in which one 
Also, a reje 
tion of the instruments is in itself a modifica 
tion of the 
or limits the kind of situations in which the 
applicable. There 


would anticipate confirmation 


theory, inasmuch as it restricts 


theory is expected to be 


may be grounds for questioning 


the adequacy of the conditions of test ad- 


stronger 


ministration, although the moderate obtained 
reliabilities are evidence against this position. 
But more 
desirable testing conditions would be advan- 


certainly parallel studies ‘under 
tageous 


[f one does not reject the results on these 
two grounds, then must one reject the theory? 
It is unlikely that 


would have any 


recommendation 
current 


suc h a 


great impact upon 
social psychological thinking, so it Is probably 
better to speak in terms of modification of the 


theory in the direction of restricting its appli 


cability. A sensible interpretation might be 
that although this kind of symbolic equiv 
alence or stimulus generalization is appro 


priate to the autistic thought processes in 
norma! dream states and in the waking states 
of extreme neurotics and psychotics, it does 
not interfere with the waking perceptions ol 
such as constituted the 


normal individuals 


test population for the present study. This 
seems the most reasonable interpretation to 
the investigators, even though one would have 
expected some slight carry-over of autistic 
thought tendencies into our test situations 

he original predictions of this study seem 
consistent with the principle of stimulus gen 
eralization in conditioning theory. From Ges 
talt theory one would also expect generaliza 


tion based upon configurational similarity from 
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one authority situation to another. Within 
the framework of Freudian theory, our results 
might be considered less surprising. Freud 
posits the similarity of opposites, and if 
reaction-formation mechanisms were present 
in our subjects, attitudes toward boss and 
father might conceivably be reversed in these 
It seems to the writers, however, 
presentations, 


instances. 
that in usual psychoanalytic 
our original expectations are regularly enter- 
tained. Even though our simple measure of 
favorableness toward father may be eliciting 
expression of reaction formations rather than 
basic attitudes, one would generally antici- 
pate that the same reaction formation would 
be used by the person to conceal from himself 
and others his underlying hostility toward 
his boss; or that the same protestations of 
rebellion which the boy manifested toward 
his father would be repeated toward the boss, 
to cover, for example, persistent underlying 
dependency needs. To explore this possibility, 
we have searched the correlation matrix for 
possible evidence of greater agreement from 
depth measure to depth measure, and from 
surface measure, than 
between a depth and a surface measure. No 


measure to surface 
such evidence is to be found. Furthermore, 
the scatter diagrams for each of the coeffi- 
cients in Table 1 have been inspected without 
success for curvilinearity or more complicated 
patterns that concepts such as reaction forma- 
tion would predict. Only in a complex analy- 
sis of inter- and intra-individual variability 
have evidences of dynamic personality proces- 
ses been found. These analyses will be reported 
later and do not reduce the incongruity be- 
tween our present findings and common as- 
sumptions in the area of personality and inter- 
personal attitudes 


SUMMARY 
Presumed attitudes toward father, sym- 
bolic authority, boss, peers and symbolic 


peers were measured by interview, descrip- 
tive trait check lists, autobiographical inven- 
tory, thematic apperception test, judgment 
of photographs, an attitude survey and a 
sociometric questionnaire. Intercorrelations 
among the 21 measures so obtained are gener- 
ally low, except where two or more measures 


come from the same instrument. In such in- 
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HENEVER individuals are required to 

guess the occurrence of alternative 

future events without any specifi 
knowledge of what to expect, they probably 
ipon previously established expectancies 
with 
his 


rely 
As S 
these 
guess 
robability of oc 


gains more and more experience 


future’ events, he can modify 


ng behavior to accord with the “true” 


urrence of such events. 


The present investigators were interested in 


Ss’ initial guesses in a probability learning 
factors might in 


rhe 


Ss with minimal informa 


ask where motivational 


fluence Ss’ guessing behavior primary 


was that 


vould overguess the 


hypothesi 
frequency of occur 


rence of stimulus events when those events 


had been previously associated with signs of 


goal attainment or  need-satisfaction \ 


secondary purpose Wf these studies was to 


investigate some semantic aspect ol Osgood’s 


} 
mediation theory (9 
The 
tially in 
that 


primary hypothesis was derived ini- 


the following way. First let us assume 


ore timulus event has been directly 


associated with ygoal-reaching behavior or 


need-satisfaction; i.e., it has become a sign 


j 7) 


faction and con epts related to it. 
that 


for nee 


Further let us a 


that signify 


sume stimulus events 


goal-reaching and need-satisfac 


tion tend to be perceived as larger, brighter, 


dense! more numerous than they really 


are (1, . ae 


stimuli that are 


are dealing 
with 


13). That is, we 


wit! isses of associated 


need-satistaction, and we consider signs of 


faction to be accentuated in 


Now. il 


Signs, 


need il per 


ception neutral stimuli are associated 


t} 


with these acquire the 


they might also 


This research was completed while the authors were 
Menninger I 


Fellows of the 


oundation as USPHS Postdoctoral 
National Institute of 
Mental Health. The study was partially supported by 
a research grant (M-715C) from the National Institute 
of Menta! Health, of the National Institutes of Health, 
Public Health The h to 
Drs. Gardner Murphy, Georg 


at The 


Clinic Research 


Servic authors wis thank 
Gard 


the 


Stern, and Riley 


suggestions in the preparation ol 


same perceptual properties as the signs, and 
be perceived as larger, brighter, denser, or 
more numerous than they really are. It would 
seem to follow, then, that if a neutral stimulus 
is thus perceived as more numerous, for ex- 
ample, S may overguess the frequency of its 
occurrence in probability learning situations. 
In the present study certain stimulus events 
were selected that could be assumed to be 
related to need-satisfaction or positive value. 
In order to choose such stimulus events, 
reference was made to factor analytic studies 
of meaning reported by Osgood and Suci 
(10). These studies were based upon correla- 
tions among several bipolar scales, the ends 
of which are defined by adjectives such as 
good-bad, strong-weak, etc. Of the three factors 
extracted—evaluation, potency, and activity 
the evaluative factor reflect 
meanings usually associated with need-satis- 


appeared to 


faction or positive value, since such scales as 
good-bad, pleasant-unpleasant, etc., had high 
loadings. For the purposes of the present ex- 


periment the three adjectives near (loaded 
high (.59), and 
bright (.69), were selected as signs of goodness, 
pleasantness, and need-satisfaction, and their 
far, low, 


41 on the evaluative factor), 


opposites, and dark, were selected 
as signs of unpleasantness, et« 

An experimental manipulation correspond 
ing to each of these pairs of adjectives was 
also selected—an object moving toward and 
away from S for near-far, arrows pointing 
up and down for high-low, and a light going on 
and off for bright-dark. The adjectives and 
their corresponding manipulations are pre- 
sented in Table 1 together with the predictions 
about guessing behavior 

In summary, the relationship between the 
adjectives and positive value is based upon a 
factor analysis of meaning by Osgood and 
(10). The assumption that the experi- 


mental manipulations reflect the 


Ssucl 
meanings ot 


the corresponding concepts is partially in- 


ferred from previous experiments relating 


such manipulations to conceptual meaning 
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TABLE 1 
OUTLINE OF THEORETICAL Mopet LEADING TO PREDICTIONS IN Four EXPERIMENTS 


Semanti 


Experi- Experimental (Adjective 
ment Manipulations Code 
I, Il Object Moving Toward S Near 
Object Moving Away from S_ Far 
Ill Arrow pointing up High 
Arrow pointing down Low 
IV Light Turned On Bright 
Light Turned Off Dark 
JAKE CLEM 
EXP. 1,0, 
LEO OTTO 
ExP 


Fic. 1. Prormes Usep in Four Experiments 


(12). The experimental manipulation is 
thought to represent a meaning similar to 
that of the corresponding adjective, e.g., 
“movement toward” signifies ‘“‘near,”’ and 
the manipulation is thus related to need satis- 
faction and positive value via the concept’s 
semantic relation to similar pleasant and 
positive meanings. 


METHOD 


Subjects. In Exp. I, the 17 Ss were housewives, 
businessmen, secretaries, and psychologists. Nine 
Ss were assigned to a condition in which objects were 
moved toward S when one profile appeared and 
away from S when another profile appeared. The 
remaining eight Ss were assigned to a counterbalanced 
condition in which the movements toward and away 
corresponded to the opposite profiles. The objects were 
never given to S but were merely moved toward him 
In Exp. I the Ss were 25 female secretarial students, 
18 to 21 years of age, from a local business school 


Semantic Associations Predicted 
with Code Perceptual Guessing 
(Osgood & Suci) Expectancy Behavior 
Good, Pleasant, Valuable Many Overguess 
Bad, Unpleasant, Worthless Few Underguess 
Good, Pleasant, Valuable Many Overguess 
Bad, Unpleasant, Worthless Few Underguess 
Good, Pleasant, Valuable Many Overguess 
Bad, Unpleasant, Worthless Few Underguess 


Five were assigned to each of four experimental 
conditions and five to a control group. In Exp. III the 
Ss were again 22 female secretarial students, 8 of whom 
were assigned to one experimental condition and 8 to 
the other; a control group of 6 Ss received no experi 
mental treatment. In Exp. IV the Ss were also 18 
female secretarial students, nine of whom were again 
assigned to each of two counterbalanced conditions 
The Ss in each experiment were randomly assigned to 
conditions. 

Materials. In Exp. I thirty 3 x 5 index cards were 
used, on each of which appeared a schematic drawing 
of a right-pointing face. There were two such profiles; 
one was called “Clem” and the other “Jake’”’ (see Fig 
1). Each profile appeared on 15 of the cards, arranged 
in a different random order for each 5. In the experi 
mental procedure one profile was associated with an 
object moving toward S and the other with an object 
moving away from S. The objects used in these move 
ments were quarters, nickles, pennies, salt shakers, 
paper clips, hairpins, buttons, and burnt matches 

Experiment II involved the same cards used in 
Exp. I. Only one coin, a half-dollar, and one nonmoney 
object, a burnt match, were used in the movements 
toward and away from 5 

In Exp. LII two decks of 30 index cards with similar 
profile drawings were used. On 15 cards in each deck a 
profile called “Otto” was drawn and on the other 15a 
profile called “Leo” was drawn (See Fig. 1). Again the 
two profiles were randomly mixed for each S. In one 
set of cards, Deck A, an arrow pointing up was drawn 
on cards bearing the Leo profile and an arrow pointing 
down on cards bearing the Otto profile. In the second 
set, Deck B, the direction of the arrow was reversed 
for Otto and Leo. 

Experiment IV involved the same decks used in 
Exps. I and LI. A small wooden box containing a seven 
watt light was used as an association stimulus, the 
light being turned on and off with a foot pedal. The 
light was turned on for one profile and off for the other, 
and the association of a profile with “light on” and 
“light off” was reversed in a counterbalanced condition 

Procedures. Each experiment consisted of a training 
series and a guessing series. In the training series S was 
told to watch while Z turned the cards over one at a 
time, exposing a profile and saying its name aloud 
Then in Exp. I and II £ moved an object toward § 
during the exposure of one profile and away from S§ 
during the exposure of the other. In Exp. III E merely 
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turned the ards over, named the profile aloud, and 
went to the next card. In Exp. IV E turned the 
light on whe ne face was exposed and off when the 
othe ce appeared 

In the guessing series of each experiment EZ shuffled 
the deck then instructed to guess which of the 
twe ‘ ild appear on the next card. When S 
had made his guess, & would turn the card over ex 
posing the profile, and S would make a guess about the 
next card. The same deck was used for both the training 


and guessing phases of each experiment except Exp 
III, in wh 


but without arrows was 


case a deck containing the same profiles 


used for guessing 


RESULT 


An analysis of variance was 


/ vper ment / 


done on the number of guesses each S made 
for the profile associated with “toward” 
movements. This 2 x 2 analysis tested whether 
the movement of “money” objects had a 
greater effect than the movement of “‘non 
money” objects and whether there were any 


differences between profiles. The results indi 


cated no significant differences between money 


and nonmoney, between the effects of the two 
profiles, or for the interaction of the two vari 
able 

Since there were no significant differences 
among these experimental variables, results 
for the different groups of Ss were pooled to 
test for differences between guesses about 
toward- and away-associated profiles. The 
two means and the /test between them are 
shown in Table 2. This table indicates that 
Ss overguessed the occurrence of toward 
associated profiles at the .01 level of conh 
fidence. Fourteen of the 17 Ss tested were 
in the predicted direction with only three in 


the opposite direction 


Ex periment I] 


The analysis of this study 


that ol Exp I. Sixteen Ss overguessed 


paralle 
rABLE 2 
FreQut rF GUESSES FOR PROFILE ASSOCIATIONS 
RELATI ro CONTRASTING MANIPULATION 
Expe tal Manipulation Mean 7D ‘ p 
Expt. 1 (4 l 
lov ara 16 SS ‘ 
JOS 3. RR < 0 
Awa 13.12 i 
Expt. Il V ”) 
I ard 16.00 
788 2.5 O5 
Aw 14.00 ‘ 
Expt. TIL (N 16 
Arrow-uf 16.51 | 374 | 3.01 01 
Arrow-dow! 13.09 
Expt. 1V (N 18 
Bright 16.33 1.073 2.48 0s 
Dark 13.6 
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the profile associated with toward-movements, 
3 Ss underguessed, and there was 1 tie. Again, 
an analysis of variance on the number of 
the profiles 


significant 


guesses for toward-associated 


showed no difference between 
movement of money and nonmoney objects, 
between profiles, or for the interaction of 
money-nonmoney and profiles. Again results 
were pooled over these experimental condi- 
tions to test differences between toward- and 
The means for these two vari- 
ables and the / test between them are shown 
in Table 2. The profile 
was significantly overguessed at the .05 level 


away profiles 
toward-associated 


of confidence. There was no significant dif- 
ference between the number of guesses for 
the two profiles in the control group. 
Experiment IIT. Since no significant differ- 
ences were found in a / test between ‘‘Leo”’ 
“Otto” profiles, data for both groups 
were pooled to test the significance of the dif- 
“arrow-up” and “arrow- 
down” The ¢ test between these 
two means (Table 2) is significant at the .01 
the overguessing the 
“arrow-up”’ Twelve Ss 


and 
ference between 

associations 
level in direction of 
associated profiles. 
overguessed the frequency of occurrence of the 
profile associated with the arrow up, 2 Ss 
underguessed it, and there were 2 ties. There 
was no difference the 
number of guesses for the two profiles in the 


significant between 


control group 


Experiment IV. Again no significant dif- 
ferences were found between the “Clem” and 
“Jake” profiles, and the data for the two 


counterbalanced profile groups were pooled 
to test the effect of “light-on” and “light-off” 
associations. The means for these two asso 
ciations are shown in Table 2 with the corre- 
Again it was found that Ss 
the occurrence of 


profiles associated with the light-on condition. 


sponding ¢ test 
significantly overguessed 


Ten Ss overguessed the profile associated with 


“light-on,” 4 Ss underguessed it, and there 


were 4 ties 
DISCUSSION 


A discussion of the results and their impli- 
take Let us 
consider the results first in terms of the pos- 


cations can several directions. 
sible operation of autistic factors and then in 
terms of their 
mediation theory of meaning (9 


The that Ss 


implications for Osgood’s 


results indicate made more 





re 
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guesses for profiles associated with movements 
toward them, arrows pointing up, and lights 
turning on These results are interpreted as 
indicating that the perceptual expectancy 
for the occurrence of prohies has shifted from 
equal expectancy, as shown in the contro 
group, to a preponderance for the positively 
associated profiles in the experimental groups 
If one considers perceptual expectancy to be 
related to cognitive processes, then this inter 
pretation is similar to the definition of autism 
given by Levine, Chein, and Murphy (7 
“Autism is the movement of cognitive proces 
ses in the direction of need-satisfaction.” 

Che movement of cognitive processes in the 
present experimental situation refers to the 
shift in perceptual expectancies as reflected 
in the preponderance of responses for posi 
tively-associated profiles This “movement” 
differs from the type considered in previous 
autism studies (6, 7, 11), in which attempts 
were made to elicit perceptual shifts in the 
direction of rewards and away from punish- 
ment The above conception of autism does 
not completely describe the results of the 
present experiments, since it is difficult to 
relate “arrows pointing up” and “‘lights 
going on” to need-satisfactions directly. In 
order to describe the results adequately in 
terms of an autism theory, a broader concep 
tualization is required. Some qualification of 
the phrase “in the direction of need-satisfac 
tion’”’ seems called for. One possible reformula 
tion might define autism as the movement of 
cognitive processes in ambiguous situations 
in the direction of meanings or interpretations 
which have positive value for the organism 
(Ot course operational speciication of the 
conditions under which cognitive processes 
“move” and what is meant by “movement” 
in individual experiments should also be made 
In the present studies the movernent is identi 
fied with a shift in response frequencies in 
favor of positive associations 

An alternative and not necessarily conflict 
ing interpretation of the present results can 
also be made in terms of differential emphasis; 
i.e., movements toward S, arrows pointing 
up, and lights going on might be considered 
to have a greater emphasizing effect than the 
opposite manipulations However, the use of 
an explanation in terms of emphasis requires 


some basis for differentiating the relative 


ow 
tn 


emphatic value of arrows pointing up v.s 
arrows pointing down, etc. The hypothesis 
nevertheless seems amenable to experimental 
appra sal 

If it is correct to conceive of the association 
stimuli used in the present experiments as 
translations of semantic codes into experi 
mental manipulations, then the results also 
support Osgood’s mediation theory and are 
consistent with his factor analysis of meaning 
(10 Ihe evidence presented here suggests a 
behavioral reality for the semantic relation 
ships obtained from Osgood’s judgmental 
studies. Thus, when a scale has a significant 
loading on the evaluative factor, for example, 
the verbal and perceptual signs associated 
with it appear to function evaluatively in be 


h ivior 


SUMMARY 


As a test of predictions derived from Os 
good’s (9) theory of meaning and Murphy’s 
(8) theory of autism, four experiments in 
guessing behavior were conducted. Adjectives 
highly correlated with goodness and positive 
value (10) were selected, and translations of 
these semantic codes into experimental manip 
ulations were specified. Movements of objects 
toward and away from S were chosen to 
correspond to near-far, arrows pointing up 
and down to high-low, and a light going on 
and off to brighi-dark. A random series of two 
profiles was presented to S, each profile 
being associated with one of the experimental 
manipulations. Later a different random series 
of the same two profiles was presented, and § 
was asked to guess the order in which they 
would appear. Profiles associated with opera 
tions reflecting the adjectives near, high, and 
bright were significantly overguessed while 


their opposites were underguessed 
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ACCURACY OF INTERPERSONAL PREDICTION AS A FUNCTION 
OF JUDGE AND OBJECT CHARACTERISTICS! 


BELA O. BAKER anp JACK BLOCK 


Institute of Personality Assessment and Research, University of California, Berkeley 


HE processes involved in identifying the 

reactions and characteristics of other 

people have been of persistent interest 
to psychologists for both theoretical and prac 
tical reasons. Many recent studies in this area 
have employed the technique of interpersonal 
prediction to secure an objective measure of 
accuracy of understanding. Largely neglected 
in such studies has been the influence of the 
person whose behavior is predicted, although 
it is widely acknowledged that the nature of 
the social object is an important determiner of 
accuracy. 

The chief aim of the present study was to see 
whether the level of accuracy achieved in an 
interpersonal prediction task is predictably af- 
fected by systematically selecting the objects 
of prediction. To this end the objects used were 
selected to represent three levels on each of 
two dimensions expected to be relevant to ac- 
curacy of prediction 

rhe first independent variable was favorable 
ness of self-description, by which is meant the 
degree to which an individual portrays himself 
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States Air Force under Contract No. AF 18 (600)-8, 
monitored by Technical Director, Detachment #7 
(Officer Education Research Laboratory), Air Force 
Personnel and Training Research Center, Maxwell 
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Air Force or of the Air Research and Development 
Command 
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design and presentation of this research. We are 
grateful especially to Drs. H. G. Gough, R. B. Holt, and 
F. Vanasek who made detailed criticism of an early 
draft of this manuscript. Many of their suggestions 
have been incorporated in this final version 


in a socially desirable light. This variable has 
been noted previously by Spanner (16) and by 
Stone and Leavitt (17) as influencing accuracy 
scores. Spanner found that the most self-criti- 
cal of four volunteer social objects was less 
accurately predicted than the other three. 
Stone and Leavitt employed what they term 
an “intermediary key” to partial self-favor- 
ableness out of the correlation between pre- 
dicted and actual responses to a personality 
questionnaire. They found that both general 
level of accuracy and generality in the sense of 
correlations between scores achieved for dif- 
ferent objects were decreased. Their explana- 
tion is that a judge who tends to attribute 
favorable self concepts to other people will 
score a number of adventitious hits when pre- 
dicting the responses of an object who de- 
scribes himself favorably, regardless of the de- 
gree of actual understanding he has of the 
object. These findings clearly suggest the hy- 
pothesis that social! objects who are moat self- 
favorable will be most predictable while social 
objects who are least self-favorable will be 
least predictable. 

The second independent variable studied 
was ego control. Ego control refers to the indi- 
vidual’s characteristic means of modulating or 
coping with his internal needs or drives. Sub- 
jects characterized by three kinds of ego-con- 
trol systems were represented in the present 
study: Over-controlled (OC), Appropriately- 
controlled (AC), and Under-controlled (UC) 
Descriptions of the behavioral properties of 
these personality systems are given elsewhere 
(1, 4, 5, 6). Summarizing briefly, OCs bind 
their tensions excessively and are relatively 
constricted, categorical, and repressing indi- 
viduals; UCs tend toward direct and unmodu- 
lated expression of their impulses and are rela- 
tively expansive, distractable, loosely organ- 
ized individuals; and ACs selectively bind and 
gratify their needs in accord with the social 
reality in which they function. 

If it is.assumed that the greater the consist 
ency or channelization of an individual’s be- 
havior, the more completely he is predictable, 
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then the rank order of accuracy scores tor 
these three groups of objects can be predicted 
(x who are conceptualized as deveioping 
relatively set ways of behaving should be 
most predictable; UCs, who are characterized 


as manifesting fre inconsistencies, should 


table; while ACs should be pre 


quent 
be least predi 
dictable at an intermediate level 

made concerning 
of judges « lassified in terms of ego 


that OC and Ut 


A further hypothesis wa 
the a curac' 


Our expectation was 


idges would be less accurate than Af judges 
Qur assumption in making this prediction was 
that two components are required for accurate 
oolal prediction, sensitivity of social percep 
tion, and the ability to integrate perceptions 


into a formulated prediction scheme. OCs are 
excluding of the 


UCs 


relatively 
field 
their inability to establish structure or a frame 


considered to be 


potential social while because ol 


of reference for themselves (4), are relatively 


unable to construct a coherent basis for pre 
diction. Since ACs possess both required char 
icteristics, the’ hould be superior to bath 
()C s and L'¢ 
\leTHOD 
Stimulus material available to judges. In the Spring 
of 1953, 100 Air Force captains were studied during a 


Institute of 
A standardized 


three-day assessment program at the 


Personality Assessment and Research 


stress interview, conducted by a grim and unresponsive 


examiner, was the iniuial procedure experienced by eac h 


officer. The interview was conducted under floodlights 


belore a group of observers with a sound movie camera 
operating* in plain view of the subject. The schedule of 
questions tarted witl How does it fee! to be in the 
movies?” and moved on to such questions as, What 
sort of a person are you?,”’ “What do you consider to be 
your main personality assets?,”” and “What do you 
consider to be your main personality handicaps?” The 


questions elicited rather rich samples of the expressive 
and coping behavior of the subject 

Nine officers were selected from the pool of 100 
potential social objects on the basis of the considera 


tions indicated below and the films of the interviews 


served as the standard stimulu aterial in the study 


rhe films were approximately two and one-half minutes 


and were of good quality both visually and 


espect to sound reproduction 


in duration 
with 

Nature of the required prediction. Im the course of the 
officer ()-sorted a set of 54 adjectives 
himself. The task 


adescr ptlions 


assessment, each 


| 


along a 9-point scale so as to describe 


categories were 


the judges was to predict these self 


ake the judges’ task somewhat lighter, several 


ombined, reducing the sorts to a 


tDr B. Cline served as cameraman during 


these 


Victor 


interview 


5-point scale with the fixed frequency distribution: 2, 


10, 30, 10, 2 
coefhicient 


The accuracy score employed was the 
correlation yetween the judge’s predicted 
Q sort for an object and that objet t’s self-sort 

The self Q sort was chosen for prediction because the 
content of the movies appeared to be highly relevant to 
the task of evaluating the object’s self-percepts and 
because the prediction of some form of self-description 
used experimental task. An important 
that the 


variations in elevation and dispersion which Cronbach 


is a widely 


methodological advantage is response set 
(9) has noted as confounding the discrepancy score are 


eliminated by the use of a forced distribution 


The Objec ls 


Measurement of object self-favorableness. To assess 


self-favorableness, six psychologists were requested to 
sort the 54 adjectives to predict the self-description of 
characterizing himself in the most 
The average intercorrelation among the 


indicating 


an individual 
favorable light 
six maximally self-favorable sorts was .73, 
that a composite of these six Q sorts has an estimated 
reliability (Spearman-Brown correction) of .94 

This maximally self-favorable Q sort was employed 
self-sorts of the 190 


A high correlation implied an 


as a reference key with which the 
officers were correlated 
officer was describing himself favorably; a low correla 
tion that he was ascribing socially undesirable charac 
teristics to himself. Selection of social! objects in terms of 
self-favorableness was based upon this set of correla 
tions treated as scores 


At the 
end of the three-day assessment program, eight staff 


Measurement of ego control among the objects 


psychologists codified their personality evaluations of 
rhe 


eight separate Q sorts for each subject were combined 


each of the officers by means of a 76-item Q sort 


into one composite Q sort which thus represented the 
staff consensus as to the nature of the subject’s per 
sonality. The sample of officers was divided into equiva 
lent groups of 50 and, within each group, the consensus 
Q dex riptions were intercorre lated. These correlations 
between persons were then cluster-analyzed (18 

was found, hve ot 
The 


When evaluated 


A number of personality “types” 


which were observed in both 50 & SO analyses 


cross validated types were combined 
test 
cluster types solated by the inverse analyses proved to 


in terms of and personality scale measures, the 


and in a fashion 
their 


be reliably different among themselves 


compatible with the defining characteristics of 


rABLE 1 
BeTWEEN Se_r Q 


KEY FOR THI 


CORRELAT\LON 
F AVORABLENI 


SORT AND THE 


Nine SocraL OBTEectT 


Ego ( trol 

elf-favorablene ( Appr Under 

Cor priate Con 

t ‘ e tr i 

l iE 

trollin 

Yb je ons! Obje 

High Self-favorableness ov 61 4 
Middle Self-favorableness 42 43 6 
Low Self-favorableness OS O8 03 
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moda] Q sorts. A fuller explanation of the procedure and 
results of these analyses is pre sented elsewhere (2 

On the basis of the Q-sort descriptions and other 
measures characterizing them, three of the groups wer« 
designated as fitting the categories of over-, under-, 
and appropriate-contro! (Ns of 26, 26, and 21, respec 
tively). These groups constituted the populations from 
which the social objects were drawn 

Specification of the social objects. Representatives of 
the three ego-control groups were selected at each 
of three levels of self-favorableness, making a total of 
nine social objects. Table 1 shows the degree of match 
ing for favorableness within the three ego-control 
groups. The ages of the social objects varied from 29 to 
44, with a mean of 34.4. Their socioeconomic origins are 


best estimated as upper-lower or lower-middle class 


The Judges 


The sample of judges. During the winter of 1954-55, 
70 applicants to medical school underwent an 18-hour 
assessment t the Institute These assessees wert 
preponderantly undergraduate, although a number 
were graduate students or in the nonacademic commun 
ity. They ranged in age from 19 to 33, with a mean at 
22. Thus, the judges were substantially younger than 
the social objects. Socioeconomically, the group was 
essentially of lower-middle to middle-middle class 
ymewhat higher than the social objects 


origin, s 

Procedure with respect lo judges. The procedure was 
administered to groups of five judges at a time in 
two 40-minute sessions. Before viewing any films, the 
subjects Q-sorted the 54 adjectives so as to describe 
themselves. They were then instructed and four films 
were shown, with suitable intervals for the judges to 
indicate their predictions. After an interval of two 


hours, the remaining five films were shown. The films 


were presented in a standard order such that no social 
object followed immediately after another object from 
the same ego-control or self-favorableness group 


Measurement of ego control among judges. At the end 
of the assessment, seven stafi psychologists codified 
their personality evaluations of each subject-judge by 
means of a 115-item © set. The seven separate Q sorts 


judge were combined into one 


describing each subject-) 


composite Q sort representing the staff consensus as to 
the subject’s personalit) 
Iwo of the Q items that were evaluated by the 


assessors for each subject were, “Tends toward over 
control of his needs and impulses; binds his tensions 
excessively elays gratification unnecessarily,” and 


“His various needs tend toward relatively direct and 
uncontrolled expression; unable to delay gratification; 
acts out.”’ The relative weighting of these two items 
provided the basis for categorizing judges into one of 
the three ego-control groupings. For each subject-judge, 
his Q score on the under-control item was subtracted 
from his Q score on the over-control item. A distribution 


of difference scores was thus obtained where high posi 


tive scores imple 1 over-control and negative scores 
under-contro 

Four judges were eliminated from the sample ior 
variou reasor one because he grew up ina loreign 
countr ne because he was extremely deviant from 





the rest of the sar ple on a variety of assessment meas 


ures. a two because thev appeared to be long more 


clearly to an ego-control category not here used—that 
of bimodzg! control. The remaining 66 judges were 
divided into three groups on the basis of the distribution 
of difference scores. The 22 individuals with the highest 
positive difference scores were designated as the group 
the 22 individuals with the 
highest negative difference scores constituted the group 


of over-controlling judges; 
of under-controlling judges; and the 22 remaining sub 
jects in the midrange were classified as appropriately 
controlling judges 

Although the judges were categorized on the basis of 
only two Q items, other items which covaried with 
these two could be examined to give a more compre 
hensive picture of the differential characterizations of 
the groups. It was not possible to make a direct com 
parison of the (-sort descriptions of the judges and 
objects, since different Q decks were used. Inspection 
of the items differentiating among the three judging 
groups yields a subjective impression that the dimen 
sion involved is quite comparable to that distinguishing 
among the object groups.’ The OCs are in both cases 
described as relatively constricted, passive, conforming 
individuals; the UCs as more ascendant, expressive, and 
rebellious; and the ACs as more moderate individuals 
who share the desirable but not the undesirable traits 
of the extreme groups 


RESULTS 
Accuracy was scored by correlating pre- 
dicted () sorts with the appropriate object self 
sorts. The accuracy scores were then treated 
by an analysis of variance design‘ adapted from 


* Tables of the items significantly differentiating 
among UCs, ACs, and OCs in the samples of objects 
and judges have been deposited with the American 
Documentation Institute. Order Document No. 5035 
from ADI Auxiliary Publications Project, Photodupli 
tion Service, Washington 25, D. C., remitting in 
advance $1.25 for microfilm or $1.25 for photocopies 
Make checks payable to chief, Photoduplic ation Serv 
ice, Library of Congress 

* It should ge noted that there is a problem involved 
in applying analysis of variance to these data. For 
practical reasons it was necessary to present the stimu 
lus materials to the subjects in a fixed, rather than 
random, order. Consequently, the possibility that 
differences between object groups are confounded by 
order efiects must be considered. The order used 
placed one member of each object classification in each 
third of the presentation series. Thus the ordinal 
nositions of objects classified by ego-control were 
OCs—1, 6, 8; ACs—3, 4, 7; UCs-—-2, 5, 9. For objects 
classified by self-favorableness the ordinal positior 
were: High—1, 4, 9; Middle—3, 5, 6; Low—2, 7, 8 
Our trust was that this approximate balancing of 
serial position within groups would largely exclude 
order effects from the main comparisons and the simple 
interactions 

Since the correlations which comprised the raw 
accuracy scores appeared by inspection to form a 
reasonably symmetrical distribution, they were 
coded for this analysis by simply dropping the decimal 
point and adding a constant of 30 to remove negative 


numbers 
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rABLE 2 
ANALYSIS OF VARIANCE OF PREDICTIVE 
ACCURACY SCORES 
apes a Sum of | Error P 
quares | Term 
Between personality-style if 2 2,596.46) b 3.05° 
judges JP 
Error b 63 26. 843.58 
Between subjects ¢ 29,440.04 
Between personality-style of ob 2 4,909.12) wi 6.071 
jects (OP 
Between self-favorableness of ob 2 6, SOB. St we (94.487 
jects (OF 
Interac ” 
OP & OF (or OLD- Residual ir 4 151.08) we 
dividual differences among 
objects 
OP & JI 4 255.04, w 
OF » JP ‘4 958.04) we 1.24 
JP x OID a 687.53 we 1.68 
Error » O4 88, 205. Se 
Error » 12¢ 50,915.38 
Error we 12¢ 24, 383.2 
Error #5 252 2,906.93 
Within subjects 28 151,733 
Total So3 161.173. 81 
* For 2 and & degrees of freedom, F = 2.30. F « = 3.15 
t For 2 and 120 degrees of freedom F ou = 4, F on = 7.31 


one described by Lindquist (13, pp. 292-297). 
Table 2 summarizes the results. Two of the 
main effects were highly significant (p less than 
005), while the other was of borderline signifi- 
cance (p approximately .06 None of the inter 
actions was significant 

rhe social objects expressing the most favor- 
able self-descriptions were most readily pre 
dicte 1, 
for them being .42. The objects at the middle 
owest levels of self-favorableness did not 


the average accuracy score achieved 


and 
differ reliably in predictability. Indeed, the low 
favorableness objects were somewhat more pre- 
dictable than those in the middle range, aver- 
scores for the groups being, 


age accuracy 


respectively, .24 and .27 


Our hypothesis con- 
cerning the effect of self-favorableness on pre- 
dictability is thus partially supported and par- 
tially infirmed 


most accurately judged, the average accuracy 


over-controlling social objects were 


score achieved for them being .35. The appro- 
priate- and under-controlling objects, for whom 
average accuracy scores were .28 and .29 re- 
spectively, did not differ in predictability. Our 


hypothesis concerning this effect is thus also 
partially supported and partially infirmed. 

The AC judges performed at the highest 
level of accuracy, achieving an average score 
of .33, and the OC judges were least effective, 
achieving an average score of .28. The inter- 
mediate UC judges, with average scores of .31, 
were not significantly different from the judg- 
ing groups at either extreme. Our hypothesis 
concerning the judges’ relative accuracy thus 
received some support.® 


DISCUSSION 


Accuracy of prediction as a function of the 
judge. Many current theories of personality 
suggest that the ability to understand others 
is an important feature of good adjustment. 
From this point of view, one would expect that 
AC judges, who are conceptualized as respond- 
ing more adaptively to reality, would be more 
accurate than either OCs or UCs in their accu- 
racy of prediction. And in fact, this was the 
case, although the magnitude of the differences 
is small. Since this finding is in general agree- 
ment with previous results (eg., 8, 10, 12), it 
will not be discussed further except to point 
out that the result does not necessarily imply 
AC judges, or better adjusted judges, are more 
“understanding” of others. At this stage of 
knowledge, the result simply indicates that 
these judges are more accurate predictors of 
others, regardless of how that accuracy is 
achieved. 

The question can be raised at this point as 
to whether the various groups of judges dif- 
fered in the manner in which they made pre- 
dictions. In exploring this question a number 
of measures were defined, two of which are 
mentioned here because of their particular rele- 
vance. The first measure, labeled ‘“‘ascription 
of self-favorableness,”’ is an index of the degree 
to which favorable self-opinions are attributed 
to others, a measure which was defined as the 
correlation between the composite of an indi- 
vidual’s predictions and the favorableness key 


* It should be noted here that the observed level of 
significance for this hypothesis is an underestimate. 
‘Lhe over-all F-test, in the situation where a prediction 
is supported, provided too conservative an estimate 
of the true level of significance. An allowance for the 
ordering of the three groups as predicted is difficult to 
decide upon. A halving of the obtained p level (.06) is 
still somewhat conservative but is closer to the correct 
significance value 
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mentioned earlier. An analysis of variance of 
these scores indicated that there were no sig- 
nificant differences among the groups of judges. 
The second measure, tentatively called “‘self- 
generalization,” was defined as the correlation 
between the composite of a judge’s predictions 
and his own self-description. An analysis of 
variance of these scores again indicated that 
the differences among the groups of judges 
were not significant. The variations in accu- 
racy described above, therefore, cannot be ex- 
plained either in terms of variations in the 
tendency to ascribe favorable self-opinions to 
others or in terms of variations in the tendency 
to ascribe one’s own self-per« eption to others. 
Accuracy of prediction as a function of the 
object. The degree of favorableness with which 
the social objects described themselves was 
found to be strongly related to the accuracy 
with which the judges were able to guess their 
self-descriptions. However, our hypothesis con- 
cerning this effect was not fully supported, for, 
although the highly self-favorable objects were 
best predicted, the objects with middle and 
low degrees of self-favorableness did not differ 
in their predictability and in fact were ordered 
differently than had been anticipated 
One partial explanation of this result is to 
be found in the fact that all the judges tended 
to predict relatively favorable self-descrip 
tions, the average correlation between the com- 
posites of individual judge’s predictions and 
the favorableness key being .39, with a stand 
ard deviation of .15. Consequently, the degree 
to which an object’s self-description correlated 
with the favorableness key helped determine 
the accuracy with which it was predicted. Such 
a finding—which is not new with us (cf. 16, 17) 
is, upon reflection, to be expected from the 
context of prediction experiments. ‘There is a 
good deal of social pressure toward the sup- 
pression of unfavorable views of one’s fellow 
man. We may cite here the homily, “If you 
can’t say something good about a person, don’t 
say anything.” Some of our subject-judges 
may have been more than ordinarily aware of 
this pressure because of the fact that they were 
undergoing an assessment and may have felt 
that it would reflect ill upon them if they 
ascribed unfavorable characteristics to the ob- 
jects. Others may have reasoned (and not too 
inaccurately) that the objects themselves, 


since they were also assessees, would have been 


concerned about making good impressions at 
the time they sorted the adjectives. In general, 
the fact that the experimental situation was 
ambiguous and even a little threatening from 
the subjects’ point of view is likely to have 
promoted cautious—..e., favorable—responses. 

Beyond the common tendency toward cau- 
tion evoked by the role of being a judge, there 
probably are reliable individual differences in 
the extent to which judges adopt the approach 
of ascribing favorableness. These individual 
differences among judges may well be in them- 
selves fruitful objects of study. But the degree 
to which they should -be allowed to enter into 
measures of judging ability must be determined 
by the interests of the investigator. If the re- 
search focus is on accuracy of prediction per se, 
then all mediators of accuracy must be ac- 
cepted as relevant. If, however, the orientation 
is not toward predictive accuracy per se, but 
rather toward a notion of “interpersonal acu- 
ity,” “social sensitivity,”’ or “‘understanding,”’ 
then the judges’ tendencies to make favorable 
predictions may be interpreted as confounding, 
and this source of variance should in some way 
then be controlled or partialled out. 

De-emphasizing the effects of object self- 
favorableness may be accomplished in a num- 
ber of ways. Some form of partial correlation 
may be employed to eliminate the ascription- 
of-favorableness effect, as in the Stone and 
Leavitt (17) study, leaving a reduced score but 
one free of this particular component. But as 
perhaps the better technique, judges may be 
prevented from predicting favorableness by 
the stratagem of employing a forced choice pro- 
cedure where the alternatives are matched for 
social desirability, as in, for example, the Ed- 
wards Personal Preference Schedule (11) 

The influence of the objects’ ego-control 
style on predictability is not to be so neatly 
accounted for as is the influence of self-favor- 
ableness. Because our hypothesis concerning 
this effect was only partially supported, the 
result probably does not warrant extensive dis- 
cussion. While the greater predictability of 
OC objects supports the conceptualization of 
over-control previously advanced, the meaning 
of the equivalence, even slight reversal, of the 
predictability of AC and UC objects remains 
obscure 

Although the finding itself does not clarify 
the process of understanding, it nevertheless 
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demonstrates that predictive accuracy scores 


can be manipulated by varying an inferred 
central dimension. This demonstration takes 
on more force when one considers the really 


startling (at least to us) finding that the re 


sidual individual differences among objects 


i.e., all individual differences among objects 
over and above those accounted for by the 
main effects—failed to elicit any significant 


variance in accuracy of prediction. Thus by 


systematically varying social objects in terms 
of self-favorableness—-a dimension defined by 
explicit features of the objects’ self-descriptive 
postulated 


all the 


responses and ego-control-—a 
structural characteristic of personality 


variance in object predictability in this experi- 


mental situation has been exhausted 

While this finding impresses us as note 
worthy, we would hesitate to make too much 
of it until it 1s repeated Self-favorableness is 


an obvious and easily controlled characteristic 


of verbal des riptions, the relevance ol which 


is greatly amplified within the context of social 
prediction experiments. Ego controi is a per 
sonality dimension underlying a broad variety 
of behaviors. That the 
count for the bulk of the variance in predicta 
Even if is the 


case, however, it still does not exclude the pos 


two together should ac 


bility is not implausible this 
sibility that other dimensions overlapping with 
the ones used here may bear a significant rela 
tionship to predictability 

Favorableness, as indicated earlier, is a con 
veniently manipulated characteristic of verbal 
descriptions, whether of self or of others. Such 
is not the case with ego control as a determi 
nant ol pred tions. It might be possible to de 
rive a key from the present data to reflect those 
differences among objects attributable to their 


This key could 


used to partial out of the accuracy 


different ego-control patterns 
then be 
those differences among them attribu 


scores 
table to the objects’ evgo-contro: styles. But 
clearly, such an after-the-fact key wv ould have 


little weneral application since it could not be 
defined independently of the objects. Experi 
mental control over the effects of this variabie 
cannot be attained, therefore, by the expedient 
of using partial correlations 

This tact, ol 
studies of predictability and merely compli 


course, poses no problems ior 


cates the study of the mediation of accuracy by 


the judges re sponse tende ncies | or studies of 


judging ability, however, the indication seems 
to be that careful attention to the problem of 
sampling is essential if results are to be gen- 
eralized beyond the specific experimental situa- 
tion. In this respect, Brunswik’s thesis (7) ap- 
plies—the same considerations that govern the 
sampling of judges should apply to the sam- 
pling of objec ts 

Dimensional extremeness as a possible deter- 
minant of predictability. In the present experi- 
ment, the social objects in the midrange of the 
independent variables studied, i.e., the middle 
level of self-favorableness and the AC level of 
ego control, were least accurately predicted. 
In each case the difference in predictability 
between these middle groups and the extreme 
group which fell next in order of predictability 
was slight and nonsignificant. But the ranking 
itself makes implausible the assumption that 
there is an ordinal relation between predicta- 
bility and either of the experimental dimen- 
sions. We would expect that the inclusion of 
more extreme representatives of low self-favor- 
ableness and under-control among the social 
objects would make the curvilinear trend more 
pronoun ed. 

hese findings, in conjunction with our 
knowledge of the process of concept formation 
and the development of frames of reference 
(14, 15 that clear and 
unambiguous categories 
aids in prediction. Socially valid personality 


categories, informal though they may be, pro- 


judges seek 


suggest 


“stereotvpes’’) as 


vide an economical and integrative framework 
from which specific predictions may be made. 
“Likes like 
“rigid” are readily available or- 


ganizing principles with a good deal 


himself’’—‘‘doesn’t himself” or 
“impulsive” 
of social 
utility. 

he social validity of 


principles, however, stems from the behavioral 


these 


ategorizing 


consistency of individuals at extreme positions 
on the continua used. In predicting the explicit 
self-descriptions of self-ambivalent objects or 
in assessing the conditional behavior of appro- 
priate-controllers, simple notions of consist- 
ency no longer apply. Consequently, accuracy 
of prediction suffers 


SUMMARY 


A sample of nine social objects was selected 
so as to be systematically varied on two dimen 


sions: ego control and self-favorableness. Sixty- 
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Six subjects viewed sound movies of the be 
havior of each object in a stress interview and 
attempted to predic t his self () sort of 54 adjec 

tives. The judges were also classified on the di- 
mension of ego control. An analysis of variance 
of the accuracy scores indicated that signifi 

cant variance was associated with ego-control 
style of the judges, and with both ego-control 
style and self-favorableness of the social ob 
jects. Residual individual differences among 
the objects were not significant. The results are 
discussed in terms of their implic ations for the 
methodology of social perception studies, with 
emphasis on the importance of a carefully con- 
sidered selection of standard social objects 
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DIFFERENCES BETWEEN SCHIZOPHRENIC AND BRAIN-DAMAGED 
GROUPS IN CONCEPTUAL ASPECTS OF OBJECT SORTING 
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CHIZOPHRENIC thinking has been charac- 


terized by a number of investigators 
8 11, 14, 18, 20, 21, 22, 23) 


or a deterioration in ab- 


(e.g., 1, 2, 7, 
as an impairment 
ability, resulting in a 


Many of these same 


stra “very great con 
creteness” (7, p. 23) 
writers (e.g., 1, 2, 7, 11, 14, 23) have empha- 
sized that the extreme concreteness of schizo- 
phrenic thinking closely resembles that of the 
brain-injured 

It is the contention of the investigators that 
studies of conceptualization have been unduly 
influenced by Goldstein’s dichotomous formu- 
” The two “con- 


lation of “abstract-concrete 


ceptual levels,” abstract and concrete, were 
originally employed by Goldstein and Gelb to 
distinguish the conceptual behavior of brain- 
damaged patients from that of normal adults. 
When the term, 


plied subsequently by 


“concrete attitude,’’ was ap- 


Goldstein and other 


investigators to characterize also the concep- 
tual behavior of a number of groups other 
than normal! adults, the inevitable result was 
to crowd together a heterogeneous group of 
referents for the term, without apparent uni 
vocality 

In a previous report (16), the investigators 
suggested that the varied uses of the concepts, 
abstract and concrete, represent a fusion of at 
least 
amount of social agreement, or communality; 


four isolable conceptual variables: (a) 


(b) order (or degree) of conceptual classifica- 
tion; relative essentiality, or salienc y; and 
(d) the presence or absence of generic terms. 
In that study we attempted to demonstrate 
that 
clearly distinguished schizophrenic and normal 


yf these four meanings the one that most 


conceptualization was the relative amount of 
social agreement, or communality, for the 
concepts employed; on the other hand, hy- 

' Employed part-time at the Houston Veterans 
Administration Hospital, where this study was con 
ducted, 

*Stu 


Veterans 


conducted while employed at the Houston 


Administration Hospital 


potheses of group differences which were based 
upon the Goldstein dichotomous abstract-con- 
crete formulation were not supported in a 
single instance. 

In the present study, the hypothesis to be 
tested is that the second variable—order of 
conceptual classification—shows the greatest 
difference between schizophrenic and _ brain- 
damaged Ss. 


METHOD 


Subjects 


The major samples consisted of a group of 37 male 
patients diagnosed as chronic paranoid schizophrenic 
and a group of 34 male patients who had been diag 
nosed as suffering from types of disorder associated 
with the more general diagnostic term, brain-damaged 
The specific medica] diagnoses included encephalopathy 
or chronic brain syndromes associated with encepha 
litis, cerebral thrombosis or arteriosclerosis, central 
nervous system epilepsy, post-traumatic 
encephalopathy, Pick’s Disease, etc.? A group of 19 
additional brain-damaged male patients whose official 
diagnoses included references to psychotic reactions 
only to check reliability of 
scoring in the object-sorting procedure 

Means standard control 
variables of age, education, and estimated intelligence 
level are lable 1. The distribution of age 
and education shows close agreement; neither of the 
mean differences for these variables approached sig 
nificance. On the other hand, the significance of group 
mean differences in intelligence exceeds the .001 level 
Since intelligence could ; ot be controlled experimen 
tally, and since the results of a previous study (16) 
had indicated a high degree of association between 
intelligence and conceptual behavior 
variance was employed to provide a statistical control 
over the effects of intelligence upon sorting behavior 


syphilis, 


were used in this study 


and deviations for the 


shown in 


analysis of co 


The Sorting Task and Rationale for Scoring 
Conceptual Areas 


The sorting task used was the Rapaport modification 
of the Goldstein-Gelb-Weigi Object Sorting Test. The 
task requires 19 different sortings of a group of 33 
objects familiar to everyday experience. Administra 


* We wish to express our deep appreciation to Dr 
Ruth M. Hubbard and other staff members of the 
Waco Veterans Administration Hospital for their kind 
cooperation in making these records available 
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rABLE 1 
MEANS AND STANDARD DEVIATIONS OF DISTRIBUTIONS 
FOR CONTROL VARIABLES OF AGE, EDUCATIONAL 
LEVEL, AND EsTIMATED INTELLIGENCE 


Age Educatior intelli 
Experimental N B —s mae 


Group 
Mean; SD |Mean| SD |Mean SD 


Brain damaged [34] 40.1/10.5| 9.3) 3.2] 95.8/14.7 


Schizophrenx [37] 40 3} 9) 6) 10.7} 2 6) 108 9}15 l 
| | 


* Based upon verbal! IQ scores on the Wechsler-Bellevue Scale; 
for the schizophrenic sample, these scores were estimated from 


results obtained with the vocabulary subtest 


tion was in accordance with the procedure described 
by Rapaport (18). 

Each sorting was scored separately as occurring in 
one of four “conceptual areas.”” The basis for scoring 
sortings into the four conceptual areas has been de 
scribed in some detail in previous reports (15, 16). In 
summary, the rationale is as follows“ Two variables, 
order of conceptual classification and extent of social 
agreement, are isolated in the operations of this experi 
ment and are termed the “open-closed” and the “pub 
lic-private”’ variables, respectively. In previous studies 
these variables have been found to be unrelated to 
each other 

Let us consider, first, the open -« losed variable. The 


more attributes that are “bound-together” within a 
conceptual classification, the more “closed” the sorting, 


or concept, is considered to be. Thus, a concept based 


upon a single attribute common to all objects within 
the conceptual grouping would be maximally “open”’ 
(e.g., a group of various objects conceptualized as 
belonging to the same class because all are composed 
of metal). On the other hand, a concept binding together 
all perceptible attributes (all of the “degrees of free 
dom”) of the objects within a conceptual grouping 
would be maximally “closed” (e.g., a concept based 
upon the absolute likeness of two sugar cubes). It can 
be seen that the more open a concept is, the more 
abstract, or generic, or the higher the order of the 
principle of classification that is involved 

As for the second variable, “‘public-privateness,” a 
relatively high degree of publicness of a concept implies 
that the principle underlying the grouping is widely 
shared by and freely communicated to others within a 
social group. For example, the boundaries for a concept 
of “roundness” are relatively clearly denotable; the 
conceptual boundaries for a concept of “goodness”’ 
would be considerably less so; thus, the exact conceptual 
limits intended by the latter term are not easily com 
municated, nor are they easily anticipated by others 


*A revised set of subarea categories and supple 
mentary instructions for scoring conceptual areas as 
used in this study has been deposited with the American 
Documentation Institute. Order Document No. 5067 
from the ADI Auxiliary Publications Project, Photo 
duplication Service, Library of Congress, Washington 
25, D. C., remitting in advance $1.75 for microfilm 
or $2.50 for photocopies. Make checks payable to 
Chief, Photoduplication Service, Library of Congress 


If these two variables, with their respective poles of 
closed-open and public-private, are visualized as inter 
secting in space, the resulting quadrants may be con 
ceived to represent the following four “conceptual 
areas”: closed-public, open-public, open-private, and 


’ ’ 


closed-private. These four conceptual areas are the 
basic scoring categories. Examples of subcateguries and 
typical conceptual groupings within each of the four 
conceptual areas are as follows 

1. Closed public area 


a. Absolute identity 2 corks; basis: “exactly 
alike” 
5b. Species identity real and toy forks; basis 


both forks” 
2. Open-public area 
a. Unitary abstraction—(all metallic objects; basis 


“all made of metal’’ 

5. Superordination several toys; basis: “all these 
are toys” 

3. Open private area 

a. Loose-representational—(bicycle bell, cork; basis 
“vou shouldn’t drive when drinking’) 

6. Over-abstraction—(red ball, red paper circle, red 
matchbook, red sink stopper; basis: “manufactured 
articles” 

4. Closed-private area 

a. Species-object mediation—(bicycle bell, lock; 
basis: “put bell on bicycle and lock it” 

5b. Closed radial matches, several smoking objects; 
basis: “you need matches to light each of these’’) 

It can be seen that the determination of the appro 
priate quadrant, or area, for a concept requires a cross 
classification, in which the effects of both variabies can 
be taken into account with one notation. By summing 
the individual S’s scores in adjacent quadrants, two 
additional measures, “total public vs. private” and 
“total open vs. closed,”’ can be derived. These measures 
provide a notation of the effect of either variable, 
independent of the other 

One additional measure, an “autistic index,” is 
derived by subtracting the sum of closed-public con 
cepts trom the sum of con epts s ored as open-private 
While the dichotomous abstract-concrete formulation 
would usually lead to the indiscriminate classification 
of concepts within both of these areas as “concrete,” a 
relatively high negative relationship between the two 
classes has been found consistently.® 


Procedure 


The object-sorting records of the schizophrenic 
group had been scored in a previously reported study 
(16). The two brain-damaged samples were drawn in 
the following way: All cases referred for psychological 
observation over a number of years for whom sorting 
protocols were available were drawn from the file of a 
large hospital facility. They numbered about 100 
These test records were rescored to obtain the measures 
described above. The official diagnosis of each case 
was then obtained and the 53 cases with complete 


* For example, with the data of the present study, 
the product moment correlation between these two 
measures was 48 for the brain-damaged group and 

63 for the schizophrenics 
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rotocols who were subsequent found to be diagnosed 


us brain damaged were selected 

It was not practicabie to provide as tight a control 
ove the po hit f tematic Dias in scoring as 
Ait avaliable m the ‘ tuc 16 Cherefore, 
evera nformal check of scoring reliabilit were 
mace t ect that turned out not to be asso 
ciated th a diagno of brain damage were compared 
with those the selected sample; for each predictive 


mea he average of the brain-damaged Ss exceeded 
that of the unselected grou Second, of the Ss who 
were ¢ tua ine ed within the brain-damaged 
ample i few sabout 10) were known by the scorer to 
have a ug? | brain damage because of notations 
on the test record ibsequent comparison of scores 
for ti mall group against the rest of the sample 
i ited no tematic differences in the obtained 
core I} the group who were subsequently identi 
hie " iin-damaged with | chotic reaction” were 
foun »o differ considerabl fror the n jor brain 
damage ‘ nile in their obtained score these differ 
Y ( were a na direction consistent with the hypoth 
emu Fina n pre yu tudies it has been found that 
th rat! ireful instruction and practice on scoring 
proce re terscore igreement can be raised to 
sbout 93 ner cent his compare favorably with a 
core rex ¢ agreement of 95 per cent in the present 
t 180 response elected “blindly and at 
rar ! m the protocols of all groups, with a time 
inte i t about x month 
Hy potice r 
In the pre tudy of differences in conceptual 
izatior etween cl phrenic and nonpsychiatric 
wre 7 Icce ful predicted that the schizo 
phrenic group would show ilo of social communica 
tior f ‘publicne n the formation of concepts, 
wit) t ence of a corresponding loss in high order 
ol com tualizatior ) of ‘openness , 16 p +5 
Th } thes) grew it f a consideration of the 
tentative tr ng ! the authors l In an 
estigatior {f diflerence n the language behavior of 
norma t was found that a predominant tendency 
t © « ‘ oncepts in object sorting was associated 
wit! timulus-boundne or “reality-fixing’’; simi 
ur i ‘ minant tendency to use private concepts 
Dbject- sorting wa ociated with sutism.”” TI 
the | pothe vas based upon the assumption that 
the peculiar forms of schizophrenic conceptualization 
should be more closely associated with autism and 
rea than with realit fixing and stimulus 
boundn 16 | tf 
Im the present stuc the opposite assumption 1s 
made nceerning the characteristic conceptualization 
of brain-damaged patient viz., that it should be more 
close issociated with reality fixing and “stimulus 
houn 7) than with autism or loss of reality 
testing 
Ihe general hypothesis of differences between brain 
damaged and scl izopt renic groups in object-sorting 
behavior is that the brain-damaged group has a greater 
deficit higher order concepts (loss of “‘openness”’ 
The specific hypotheses stated in terms of the opera 
tior { the study, are as follows: In comparison with 
the act hrenic group, the brain-damaged group 
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1. Employs significantly fewer open and significantly 


more closed concepts 


2. Employs significantly fewer (a) open-public and 

b) open-private, and significantly more (c) closed 
public and (d) closed-private concepts 

3. Scores significantly lower on an autistic index 


consisting of open-private minus closed-public concepts 


RESULTS 


The results of an analysis of covariance for 
the con eptual area measures are shown in 
lable 2. The values shown in the “adjusted F 
ratio” column reflect only the residual vari- 
ances attributable to experimental group dif- 
ferences after the covariance between intelli- 
gence and the conceptual area measures has 
been withdrawn. The extent of this covariation 
for each measure for Ss in the two groups may 
be seen in the column of correlation coefficients. 

All six measures involved in the hypotheses 
indicate resu'ts consistent with the prediction 
of greater deficit in higher order concepts (loss 
of openness of conceptualization) for the brain- 
these group mean 
the .01 level or 
better.* The other two (open-public and closed- 


damaged group. Four of 


differences are significant at 


private) both show considerable mean differ- 
ences but also show relatively high correlation 
values with intelligence. The greatest F ratio 
is shown for the closed-open variable, which is 
the major predictive measure for the present 
study. No hypothesis was formulated for the 
public-private variable, which served as the 
the 
study; in 


major predictive measure in previous 


the 
present study, the mean difference between 


schizophrenic-nonpsychiatri¢ 


It can be seen that 
the predominant private conceptual area for 


groups is relatively slight 


the brain-damaged group is closed-private, as 
contrasted with the predominance of the open- 
private area in the conceptual behavior of the 
schizophrenic group. Apparent differences in 
meaning for these two kinds of private concep- 
tualization will be discussed more fully below. 

A comparison of 
obtained in the previous schizophrenic-non- 


these results with those 


psychiatric study (16) indicates that the ex- 
treme group separation in every con eptual 


* F ratios to test for homogeneity of variance were 
computed for all measures in Table 2 for which p values 
for F ratios are shown. The results of these tests indi- 
that 
variance is tenable in all cases except for the open 
the ratio for the closed 
private measure also approaches significance. 


cate the hypothesis of a common population 


private measure, although 
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TABLE 2 
SIGNIFICANCE OF DIFFERENCES IN CONCEPTUAL AREA 
Scores BETWEEN SCHIZOPHRENIC AND Brartn-DAm 
AGED GROUPS AND CORRELATION VALUES BETWEEN 
CONCEPTUAL AREA AND INTELLIGENCE TEST 
Scores ror TuHese Groups 


Schize Brain 
p oe meneaes Ad 
Measure oom p ’ 's 
r Ratic 
. - 
azia|a/e 
Public 11.00)4.70)11.27)3.¢ 0.82 42 01 
Closed 6.05)3.35|11.18|3.29] 25.34 |<.001|—.32]<.01 
Closed- Public 3.78)2.62) 6.77\3.46) 14.71 001 4 
Closed-Private 2.27)2.11) 4.41)2.93 1.78 06 46,< .01 
Open-Publ 7.22|3.98] 4.50|2.74] 2.43 48|<.01 
Open-Private $.73|3.95) 3.32/2.48) 10.11 1 1 
Autistic Index* 11.95|5.95| 6.5 13) 22.99 om 29) « 
* Open-private minus closed-public, plus 10 


area measure, except that of public-private, is 
between the schizophrenic and brain-damaged 
groups. In other words, both kinds of disorder 
produce departures from normal conceptual 
behavior, but apparently in opposite directions 
for at least most of the measures 


DISCUSSION 
Methodological Considerations 


A point relating to methodology concerns 
the effectiveness of cutting points for the 
separation of schizophrenic and brain-damaged 
groups. While the writers were not primarily 
concerned in this study with the matter of 
individual diagnosis, they do heartily concur 
with Yates (24) that such information is basi 
to this type of study. Using the sum of “‘closed”’ 
concepts as the major predictor of group dif 
ferences and the total group mean (between 
scores 8-9) as the most reliable cutting point, 
the number of correct predictions for the 
brain-damaged group was found to be 26 out of 
34 (77 per cent), and for the schizophrenics, 31 
out of 37 (84 per cent). On the whole, it would 
seem reasonable to conclude that this type of 
analysis will not be very serviceable in individ 
ual diagnosis until the present techniques can 
be altered to provide a more sensitive measure 
of group differences in conceptual behavior 


Theoretical Considerations 


In the writers’ cpinion, the Goldstein ap 


proach represented real progress in this much 


neglected research area. But, more recently, a 


number of investigators (e g., 3 +, 5, 6, 9, 10, 


12, 13, 15, 16, 17) have indicated that, if the 
venture is made to ‘“‘cut through” Goldstein’s 
dichotomous formulation of abstract and con- 
crete conceptual levels, the yield is an im- 
provement in prediction of group differences 
in conceptual behavior. The present investi 
gators have attempted to show that the use of 
two isolated variables, previously embedded 
within the congeries of meanings for abstract 
concrete, may yield significant quantitative 
distinctions in the conceptual behavior of clini 
cal and “normal” groups, which are not evident 
when the sorting task data are analyzed within 
the duality of an abstract-concrete formula- 
tion. Thus, in a previous study (16), it was 
found that the schizophrenics showed a relative 
loss (or lack) of publicness in their conceptual 
behavior but that, in terms of order of classifi- 
cation, they actually showed more openness of 
conceptualization than did the nonpsychiatric 
group. In the present study, on the other hand, 
the brain-damaged group showed an extreme 
lack of openness of conceptualization; and 
with this measure the greatest group separa 
tion was between the two deviant groups, 
brain-damaged and schizophreni 

The mean difference between the brain- 
damaged and schizophrenic groups on the 
public-private measure did not approach sig- 
nificance. This closeness of agreement is, in 
one sense, an artifact produced by canceling 
out opposite group tendencies on the separate 
conceptual area scores, open-private and 
closed-private; that is, in their private concep 
tualization, the schizophrenics are predom 
inantly open and the brain-damaged are pre- 
dominantly closed 

rhese differences in kinds of privateness are 
sufficiently pronounced to merit further con 
sideration. The judgment of the privateness of 
a concept is based upon a criterion of relative 
inability of an observer to specify the limits of 
the conceptual grouping on the basis of a 
clearly denoted principle. Considered another 
way, a concept is private to the extent that 
the outsider would be unsure in anticipating 
whether additional objects would be included 
in or excluded from the sorter’s conceptual 
grouping. While this criterion is applied in 
scoring both areas (open-private and closed 
private), the quality of privateness in the two 
categories differs markedly 

An open-private concept has the quality of a 
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“personal idiom” (5), The most frequent types 
are idiosyncratic forms of symbolism, overab- 
stractions, and those which show ellipses in the 
conceptual process. The several concepts which 
Cameron (3, 5) has introduced (e.g., overin- 
clusiveness, asyndesis, metonymic distortion) 
to characterize schizophrenic thought are fre- 
quently apt descriptive terms for open-private 
conceptualization in the sense 
that they also refer to a form of thinking 
which lacks communicability, reality-testing, 
etc., but which involves the use of a relatively 
(e.g., are 


They are so in 


high-order of abstraction “these 
different forms of energy’’). 
Schizophrenics, and others who employ an 
open-private form of thinking, frequently use 
some aspects of the objects in the sorting task 


simply to “trigger” a series of autistic associa- 


tions. They are relatively insensitive to exter- 
nal “feedback,” and they show a relative lack 
of self-monitoring, and lack of set toward 


communication. Little or none of this seems 
true of the thinking of those who employ 
Here, 
the quality of privateness in their conceptuali 
but 


predominantly closed-public concepts 


zation results not trom overabstraction, 
usually from underabstraction. 

Many of the same subcategories are used in 
scoring both closed-private and open-private 
areas. The basic principle that distinguishes 
between these scoring areas is that of lower or 
higher order of conceptualization. Closed-pri- 
the 


heterogeneous grouping of a number of ob 


are usually based upon 


vate concepts 
jee ts “concept ualized”’ by the sorter at separate 
species-levels.’ Open-private concepts are usu 
ally based upon a syne dochal, or other type of 
loose-representational, relationship perceived 
by the sorter among the objects within the 
conceptual grouping. Thus, in an example of 
the ‘“‘closed’”’ narrative (‘‘I could smoke this 
cigar after I finish my coffee’), each of the 
grouped objects (cigar, matches, spoon, sugar 
cubes) is appropriate in its present species form 
to the “principle” of belongingness expressed. 
On the other hand, in an example of the “‘open”’ 
narrative (“fellow might want to smoke after 
the 


In 


he eats his dinner and then go bowling’”’), 
sugar-cubes to food 
general; and, even more distantly, the attribute 


are used represent 


'The term “species” is defined as “a group of 
objects, each of which is ‘called,’ or usually identified, 
by the same name (for example, knives, pencils)” 
(15, p. 185 
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of “greenness” of the paper square is abstracted 
and used synecdochally to represent the ac- 
tivity of bowling (inquiry revealed that the 
missing conceptual link was “bowling on the 
green’’). 

The privateness of the “closed” narrative 
cited above lies in the failure of S to designate 
conceptual limits to his grouping; that is, it 
would be difficult for the outsider to predict 
whether or not each of a number of additional 
species-objects coming within the perceptual 
field of S would be included as “belonging”’ to 
the conceptual grouping, since no actual prin- 
ciple of collection is expressed. Thus, a closed- 
private concept is one in which there is virtu 
ally a complete absence of a collecting principle 
for a group of objects, considered by S to 
“belong together.” 

The virtual 
generic, principle (closed-private conceptuali- 


absence of a collecting, or 
zation) and the use of conceptual classifications 
of a relatively low order of abstraction (closed- 
public conceptualization) are characteristic of 
the of the brain-damaged 


group. We have previously (15, 16) associated 


sorting behavior 


the terms ‘ 
fixing’ with this form of conceptual behavior 
As distinguished from the concepts cited above 
to characterize schizophrenic thinking, these 
terms seem to us to bear a very close resem- 
blance to some of the criteria invoked by 


‘stimulus-boundness” and “‘reality- 


Goldstein, Storch, Stern, and others to charac- 
terize concrete thinking in general; examples 
of these are impairment in discrimination be 
tween figure and ground (7), restriction of the 
conceptual response to the “immediate experi 
ence of the given thing or situation in its 
particular uniqueness” (7, p. 18), and concep- 
tualization based upon a preponderance of 
concrete images (19). But it should be noted 
that criteria for behavior 
seem appropriate only for the specific kind of 
concreteness that is shown in a predominantly 
“closed”’ form of conceptualization. 

On the basis of the findings of this study, 
the following conclusions are drawn: 

1. The use of two isolated variables of con- 


’ 


these ‘concrete’ 


ceptualization appears to be a more fruitful 

approach than the use of the dichotomous 

abstract-concrete formulation in predicting 

qualitative group differences in conceptualiza- 

tion. 

2. The extreme differences in conceptual 
brain- 


behavior between schizophrenic and 
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damaged patients suggest that the two groups 
should not be represented as evidencing the 
same type of conceptual disorder. 


SUMMARY 


This was a study of differences in object- 
sorting behavior between schizophrenic and 
brain-damaged groups with respect to two 
conceptual variables—amount of social agree- 
ment and order of conceptual classification. 
It was hypothesized that the brain-damaged 
group shows a significantly lower order of 
conceptual classification than does the schizo- 
phrenic group. 

The variable, social agreement, was scored 
in terms of the relative publicness-privateness 
of each conceptual sorting. The second vari- 
able, order of classification, was scored in terms 
of the number of attributes used in the defini- 
tion. In addition to these two measures, five 
others were derived from the interaction of the 
two variables. 

An analysis of covariance (controlling for 
intelligence) indicated that all six measures for 
which hypotheses were formulated yielded 
results in the predicted direction and that four 
of these mean differences were significant at 
the .01 level, or better. The greatest F ratio 
was for the closed-open variable, in keeping 
with the major hypothesis. Differences on all 
of the measures, except public-private, were 
greatest between the schizophrenic and brain- 
damaged groups with the results of the nonpsy- 
chiatric group falling in between. 
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A COMPARISON BETWEEN IPSATIVE AND NORMATIVE 


RATINGS OF PERSONALITY! 


JACK BLOCK 


Per 


Institule of 


LTHOUGH the distinction has been noted 


for some years (2, 5, 6), it was Cattell 
(3) who first named and paid detailed 
attention to the difference between “ipsative”’ 


and “normative” forms of psychological meas 


In ipsative measurement, the set of 


urement 

cores (traits, characteristics, behavioral meas 
ure are weighted or ordered relative to that 
ind lual’s own personal mean. In normative 


measurement, each score for an individual is 
evaluated relative to the mean score of a group 
of individuals. An illustration of the ipsative 
method is the rank ordering a subject may pro 
vide of his esthetic preferences among a variety 
of alternatives. Here the individual has a per 


sonal norm or indifference point tn terms of 


which his choices may be evaluated. Norma 
tive measurement 1S exemplified by any test 
which employs group norms in terms ol which 


an individua achieves As 


Cattell remarks, the distinction has consider 


core meaning 


implication for 


able the propriety of various 


methods of statistical evaluation, for the Q vs 
k sy in factor analysis, and for di 
verse 

(one 


controve!l 
other issues 
ol 


normative 
yf personality are employed. For rea 


the 
important 


the contexts where ipsative 


is is where 


distinction 


ratings ¢ 
ons of convenience, necessity, and intrinsi 


personality evaluations by observers 


interest 
have been and will continue to be widely used 
re rch. Traditionally, personality ratings 
have been made normatively, e.g., 100 sub 
jects art rated as to their ‘aggressiveness.’ 
More recently, the ()-sort method of personal 
I'his research was supported in part by the United 
tates Air Force under Contract No. AF 18 (600)-8, 
monitored by Technical Director, Detachment #7 
Officer Education Research Laboratory Air Force 
Personnel and Training Research Center, Maxwell Air 
Force Base, Alabama. Permission is granted for repro 
duction, translation, publication, use, and disposal in 
whole and in part by or for the United States Govern 
ment 
Personal views or 0 ions expressed or implied in 
thi pul ication are not to be construed as necessarily 
carrying the official sanction of the Department of the 
Air Force or of the Air Research and Development 
( ommand 


WY 


onality Assessment and Research, University of California, Berkeley 


ity description, an ipsative procedure, has 
achieved some usage. 

rhere has been some question as to the rela 
tionship existing between the two measure- 
ment approaches. Cattell suggests that ipsa- 
tive treated normatively will 
provide results different from those derived 
by direct normative measurement (3, p. 296). 
More recently, Guilford has written, “Indi 
measurements 


measurement 


vidual differences in 
have little meaning because there is not a sin 


. (Ipsative scores 


Ipsative 


gle scale for all individuals . . 
should not be used for correlation of variables 
over a population of individuals” (4, p. 528) 

Clearly, there is good logical reason for sus 
pecting a lack of equivalence between the two 
measurement approaches. The fact that in the 
ipsative case the mean in terms of which traits 
are evaluated may vary from individual to in 
dividual may make possible bizarre and inap- 
propriate relationships 

However, the matter is open to empirical 
investigation as well as rational analysis. The 
present paper reports tt 
in which ipsative ratings, treated normatively, 


1e results of two studies 


were correlated with corresponding normative 
ratings, in a test of the functional equivalence 
of the two forms of measurement. The findings 
are, to the investigator at least, somewhat sur- 
prising. A number of current rating and assess- 
ment procedures perhaps bear revision in the 


light of these data. 


ANALYSIS | 


As part of an assessment program at the Institute of 
Personality Assessment and Research, 100 Air Force 
officers were observed period by 
staff of eight psychologists. At the end of assessment, 
each of the ten staff members rated each subject (8) on 


for a three-day a 


3O aspects of personality. Each rater used a 5-point 
rating scale and, in evaluating the 100 Ss, considered 
each rating variable singly. The ratings were afterward 
averaged to form for each S one score for each dimen 
These data served as the normative measures in 
comparison 
Eight of the ten staff members also described each 
The items 


s10n 
the 


subject by means of a 76-item Q-sort deck 
were evaluated vis-a-vis each other for relevance and 
in characterizing each S’s personality, and 
The 


salience 


sorted along a 9-point continuum scale scores for 
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rABLE 1 
CORRELATIONS BETWEEN NORMATIVE RATINGS AND EQUIVALENT Q-Sort ITems: First Stupy 






Reliability of normative and ipsative ratings are in parentheses) 






Corrected 






[psative Ipsative 
Normative Normative 
Correlation Correlation 











1. “Ability to communicate” (.90) vs. “Communicates ideas clearly and effec 88 1.05 
tiveiy” (.78 y 
2. “Adaptive flexibility” (.87) vs. “Is rigid; inflexible in thought and action” 78 94 
ov 
3. “Breadth of interests’”’ (.88) vs. “Has a narrow range of interests” (.91) 85 96 
; 4. *‘Constriction 83) vs. “Overcontrols his impulses; is inhibited; needlessly 82 96 
- delays or denies gratification’”’ (.88) 
d 5. “Dominance” (.90) vs. ““Takes an ascendant role in his relations with others’ 82 91 
YO 
6. “Evaluation of ideas” (.80) vs. “Has the ability to draw dependable and 73 90 


















































, practical inferences; common sense; good judgment i3 
7. “Fairmindedness” (.81) vs llows personal bias, spite or dogmatism to 10 93 
enter into his judgment of issues’’ (.69 
8. “Fluency of ideas 90) vs. “‘Is stereotyped and unoriginal in his approach S4 97 
, to problems” (.84) 
9. “Good judgment” (.88) vs. ‘Has the ability to draw dependable and prac 75 94 
tical inferences; common sense; good judgment”’ (.73) 
10. “‘Health and vitality” (.79) vs. “Is active and vigorous”’ (.84) 71 &S 
5 11. “Impulsivity” (.83) vs. ‘““Undercontrols his impulses; acts with insufficient 69 $i 
oO thinking and deliberation; unable te delay gratification” (.75 
12 Intellectual competence 90) vs. “Has a high degree of intellectual abil 6 1.00 
t ity 4? 
s Leadershi; 94) vs Is an ctive leader”’ (.80) 79 91 
’ Likeability 73) vs Is a able person (N.B. the subject’s general 66 KO 
acceptability rather than the rater’s personal reactions is intended $1) 
) 15. “Masculinity 86) vs. “‘Is masculine in his style and manner of behavior” i9 O4 
, 16. “Military & social presence”’ (.79) vs. “Lacks social poise and presence; be 79 1.01 
_ comes rattled and upset in secial situations 44 
t 17 Naturalness” (.74) vs. “‘Is naturai; free from pretense; unaffected” (.80) 73 95 
_ 18. “Originality” (.86) vs. “Is stereotyped and unoriginal in his approach to 87 1.02 
prot lems”’ (.84 
yy 19. “Positive valuation of the military identity” (.79) vs. “Identifies with values, 62 8&3 
e actions, attitudes, etc., of his professional group” (.72) 
P 20. “Rigidity” (.79) vs. “Is rigid; inflexible in thought and action’”’ (.80) 77 9] 
2 *Self-confidence’ 80) vs. ““Lacks confidence in his own ability ’ (82) x) yy 
5 22 self-defensiveness” (.71) v ] elf-defensive; rationalizes, excuses, 63 94 
r lames 64 
23. “Self-insight 70) vs. “Lacks insight into his own motives and behavior’ 66 1.02 
S- 61 
i 24 Verbal fluenc 90) vs Is verbally fluent; conversationally facile 83 4 oO 
each item were afterward summed across raters in This figure makes no allowance for the lack of exact 
of order to generate tor each 5 one score tor each Q item equivalence in certain ol the comparisons 
ce These data served as the ipsative measures in the 
: beta nh Anatysis II 
it It proved possible to match 24 of the 30 normative rhe second comparison employed data provided by 
q ratings with YQ items. Mate hing Was Gone ON an 4 priori a one-day assessment of 70 applicants to medical 
nt basis by two psychologists and the criterion for match school. A group of seven psychologists contributed 
ed ing was simply that the sensible meaning of the Q item normative ratings on 25 variables while a separate 
rd be considered to correspond fairly closely, if not exactly, group of seven psychologists Q sorted the Ss using a 
n to the definition of the normative ratings 108-item “clinical” Q deck. This deck was oriented 
in For each of the 24 pairings, the correlation between toward more highly inferential (“‘genotypical”’) features 
normative and ipsative scores was computed These of personality than the descriptive (“phenotypical” 
h are listed in Table 1, which also identifies the rating and items used in the first study, or the normatively rated 
n Q-item variables, the respective split-half reliabilities variables employed in both analyses. Because of the 
id and the ipsative-normative correlations corrected for greater complexity of these Q items, the matching of 
id attenuation. Over all comparisons, the median ipsative normative ratings with ipsative items proved to be 
normative correlation, corrected for attenuation, is .95 more difhicu!t and in certain instances rather arbitrary 
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TABLE 2 
CorRELATIONS BETWEEN NorMATive RATINGS AND EQuivaLent Q-Sort Items: Seconp Stupy 
(Reliabilities of normative and ipsative ratings are in parentheses) 


Corrected 


Ipsative Ipsative- 
Normative Normative 
Correlation Correlation 
1. “Breadth of interests” (.83) vs. “‘Has a wide range of interests” (.58) 74 1.07 
2. “Dominance” (.86) vs. ‘“‘Behaves in an assertive fashion” (.82) 53 63 
3. “Ego-strength” (.84) vs. “Has ittle ego-defense system; has a small 44 — .62 
margin of integration; precarious se!f-contro 61 
4 ‘in pulsivity”’ (75 vs His various needs tend toward relatively direct 64 .95 
and uncontrolled expression; unable to delay gratification; acts out”’ (.59) 
ntellectual competence 76) vs. “Has a high degree of intellectual compe 55 76 
tence oy 
6 trospectiveness” (.85) \ Is introspective and concerned with self as an 00 A, 
ject; frequently self-aware 71 
] Masculinity’ (.82) vs. “Is masculine in his style and manner of behavior” 65 86 
68 
a Veed for achievement” (.66) vs. “Has high aspiration level for self; am 31 57 
itious; wants to get ahead” (.45 
9 “Originality 69) vs. “Is original and imaginative” (N.B. quality of thought 61 . 89 
rather than quantity is to he emphasized) (.69 
10. “Responsibility” (.68) vs. “Is a dependable and responsible person” (.46 54 96 
1] elf-insight”’ (.82) vs. “Has insight into his own motives and behavior” 48 75 
Wy) 
12 il acuity 68) vs. “Is socially perceptive; responsive to a wide range 52 78 
al cue 65 
13 il presence” (.84) vs. “Has social poise and presence; appears socially 58 69 
aie a9 
14 Verbal fluency” (.72) vs. “Is a talkative individual” (.78) 61 81 
I cen matchings were deemed possible. Ipsative not so complete as in the first study, the ipsative 
normative correlations then were computed from scores normative correlations should show some decrease. 
derived n the first analysis. These are presented in Taking the two studies together, therefore, it would 
lable 2, along with the split-half reliabilities of the appear that there is an almost complete functional 
ipsative | normative measures, and the ipsative identity between the results obtained from ipsative 
normative correlations corrected for attenuation ratings treated normatively and conventionally ac 
Che findings from the second analysis are not quite so quired normative rating data. The one can be directly 
pectacular as in the first comparison. But the median substituted for the other, with insignificant loss of 
ipsative-normative correlation, corrected for attenua discriminations 
tion, is still .79. This is quite a respectable figure and is 
found ce ite the fact that the second analysis provides DISCUSSION 
a muct cter and perhaps unfair test of the equiva ; ° 
lence between ipsative and normative ratings. By Why is it that the ipsative-normative ratings 
using arate groups of raters in the second study, the appear to be so fully equivalent? One possibil 
possibility is eliminated of correlated rating error, a ity would seem to lie in the nature of the ipsa 


factor which from one point of view may have affected tive judgmental process. In ipsative ratings, 


the resu of the first analysis. But the use of separate , . 
the decision between qualitatively different 
rater ips introduces a source of variance which is - 


extraneous to the comparison of interest, i.e., the results Variables as to “‘salience” is made with refer- 


provid at the two methods. Because the correlation ence to a very large appert eptive body of in- 


between the two measurement approaches in the second formation and evaluation as to the psychology 


study reflects possible real differences between the two 


of human beings. This “adaptation level” is 


sets of raters as well as a possible real difference between 

the two methods, it is reasonable to find the coefhcient from the sets of reliability estimates of these ratings 

somewhat lower In the first study, ipsative and normative dimensions 
Another cause of attenuation is to be found in the required approximately the same level of rater infer 

poorer ipsative-normative matchings in the second ence and their reliabilities were about equal (means of 

study. The ipsative ratings in the second analysis were 80 and .84, respectively, computed via the Z trans 

more inferential and depended less on behavior than formation). But in the second study, ipsative ratings 


did the normative ratings.* Because the matchings were — were clearly more gt notypic in their emphasis. Here 
the ipsative reliabilities suffered where the normative 
* Suggestive evidence for order of complexity as reliabilities remained the same (means of .66 and .78 


affecting the ipsative-normative correlations comes respectively; computed by the Z method 


d 
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based upon some sum or integration of the 
rater’s lifetime experience with and concep 
tualization of various personalities. The frame 
of reference from which the ipsative judgment 
is made is therefore in actuality normatively 
derived, and it is not surprising (with after-the 
fact wisdom) that the considerable equivalence 
exists. 

But the normative reference base from which 
ipsative ratings are made is a much broader 
one than that employed for most normative 
ratings. In the case of ipsative ratings, the 
normative adaptation-level is a very stable one, 
accumulated over many years and not subject 
to radical change when a short run of dispro 
portionate personalities is encountered. Some 
thing like an absolute scaling of the rater’s im 
pressions is obtained 

In the case of directly normative ratings, the 
reference frame is usually a specifically de 
limited population; e.g., military officers, ap 
plicants to medical school, and so on. Because 
in an absolute sense populations may reliably 
differ with respect to their mean and range on 
various personality variables, relationships 
among these variables are likely to shift appre 
ciably as a function of the particular samples 
studied (1). A “‘masculine” graduate student 
may not be a “masculine” military officer when 
rated vis-A-vis this second population. But in 
the generalized normative frame in terms of 
which the ipsative rating is made, his rating for 
“masculinity”’ is likely to remain the same, re 
gardless of the particular sample of which he is 
a member. Ipsative ratings, normatively 
treated, thus offer the possibility of providing 
more stable interrelationships among variables 
than has usually been the case 

Which method, ipsative or normative, is to 
be preferred in research settings? The norma 
tive method clearly has a more immediate con 
venience since only one variable is dealt with 
at one time and the subject sample may be 
readily evaluated in terms of the particular one 
or two dimensions of interest. To achieve ipsa 
tive scores on but a few dimensions, al/ of the 
variables in the ipsative sample of traits must 
be evaluated for each subject, a task which may 
be quite formidable. It should be noted, 
though, that the normative approach loses its 
advantage of convenience as the number of 
variables of interest increases 


Although there are many contexts where the 


ipsative method is inefficient or inappropriate, 
there is one somewhat unusual benefit to be 
enjoyed from its use. Frequently, in research 
retrospect, e.g., in the summing up of longitu 
dina! studies, one wishes that an additional 
variable or conception of personality had been 
evaluate’l with reference to the now departed 
subject sample. With the normative method, 
there is no simple or easily justified way in 
which these belated evaluations may be gath 
ered. Memories prove faulty or the originally 
participating raters may no longer be available. 
With ipsative data, however, the desired rat 
ings are quickly achieved 

As an empirical illustration of how the omis 
sion may be rectified, consider the variable 
“Over-all Military Effectiveness in a Command 
Position,” normatively rated in the first study 
reported above. This variable could not be 
matched with any of the 76 ipsative Q items 
and was not employed in the main body of the 
analyses. Let us suppose that some years after 
the actual study, some evaluation of this di 
mension was desired for criterion purposes. An 
ipsative measure could be obtained in the fol 
lowing manner: 

1. Each of a number of staff psychologists 
would Q-sort their conception of the Air Force 
officer manifesting the greatest degree of 
“Over-all Military Effectiveness in a Com 
mand Position.”’ In actual fact, the average 
intercorrelation of five conceptual Q sorts was 
80. Only two of the five sorters of the cone ept 
had previously observed the officer sample and 
their sorts did not interrelate differently than 
did the remaining three 

2. A composite of this conceptual model 
would then be constructed by summing 
the Q-item placements across sorters. In the 
actual case, this composite has a reliability 
of .8Y 

3. The empirical Q sorts of each of the 100 
officers would then be correlated with the con 
ceptual composite 

4. These correlations with the composite 
could be treated as a set of scores ordering the 
subjects on the specified dimension. In the a 
tual illustration, when these correlations, con 
sidered as scores, were correlated with the 
initially obtained normative ratings of the di 
mension, the uncorrected correlation was .85 
Substantially the same degree of equivalence 


existed in several other tests of this technique 





JACK 


is it would appear that by use of concep 


tual © sort ind an vin © set which is 


comprehensive in nat can be derived 


from actual © sorts whi rtain to dimen 


sions not specifically ine uded in the () set or 


otherwise anti pated The normative method 


does not offer this useful facility in any con 


venient form, in part because normative rat 


ings are usually too few in number, but most 


particularly because normative ratings as usu 
ally gathered are too tied to the particular re 
ferent popu ation 


It is perhaps wise to close on a cautionary 


note The pre sent paper deals only with the 


comparison of ipsative and normative ratings 


The ipsative-normative distinction has rami 


fications much beyond those touched upon 


here. In particular, the issues involved in com 


paring ipsative and normative objective mea 


ures await empirical study 





SUMMARY 


Ipsative and normative ratings of personal- 


ity were compared in two separate studies 


joth analyses showed an almost complete 
equivalence between the two methods. A pos 
sible reason for this relationship was discussed 
and some of the advantages of the ipsative 
approac h presented. 
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THE EFFECTS OF GROUP EXPECTATIONS AND SELF-ESTEEM 
UPON SELF-EVALUATION' 


EZRA STOTLAND, STANLEY 


AND 


University 


ESEARCH on the level of aspiration has 
demonstrated quite well that a per 
son’s evaluation of his performance 

does not depend upon his absolute level of 
What counts is the level of 


relative to 


achievement alone 


achievement his aspiration his 


goal line. If the achievement lies on or above 


the goal line he will probably rate his per 


formance as good depending upon the size of 


this difference. These studies have been care 


fully reviewed and interpreted by Lewin ef al 


(5 

rhe placement of the goal line, whether it is 
different 
relerence 


influenced by 
kinds of 


mentioned often 


high or low, may be 


scales of reference. Two 


scales have been One is a 
source ol pressure irom other persons or groups 
to perform in a particular fashion on that 
activity. The other involves pressure from the 
person’s own expectations concerning his likely 
performance in that activity, arising from 
past In the 


examine the effect of a specific level of achieve 


experiences present study we 
ment upon an individual’s evaluations of his 
performance when the achievement is relative 
to a level of aspiration established by a group, 


and when the achievement is also relative to 
the 


himself as represented by his self-esteem 


member’s stabilized expectations about 


Group Expectations 


Ihe ability of a group to determine levels of 
individuals is discussed by 


different 


aspiration for 


Lewin ef al Severa ipproaches 


have been used in studying these events. In 


the 


study, 


method most close to the purpose 


a group’ level ol aspiration was 


scribed to the members and they then 


cepted this level as their own 


his approach suggests that we may con 


ceive of the level of aspiration aS a group 


1 The work on this study was financed by a grant-in 
aid |M-701 (( from the National Institute of Mental 
Health of the Nationa! Institutes of Health, Public 
Health Service, Alvin Zander, principal investigator 


! 
? Arthur R. Cohen is now at Yale University 
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f Michigan 


standard, which is the resultant of pressures to 
achieve that level exerted by the group upon 
the The 
ability to influence the beliefs or behavior of 


members power of a group is its 
its members. If the group has enough power to 
influence the members, and the group standard 
is high, the members will set high levels of 
themselves. If the standard is 


aspiration lor 


low, they will set low levels of personal 
aspiration (0) 
The power of a group to influence its mem 
different conditions. It is 
that a 
that 
Thus, the 


will be 


bers may vary under 
the work 


group’s power is greater 


evident in ol Festinger (3 
the 


group is attractive to its members 


more the 


members’ levels of aspiration more 
the the 


the more that the group is attractive 


closely modeled after 


group 


aspiration ol 


to them. If attraction to the group is constant 


and moderately high among all members, 
however, another matter determines whether 
they conform to their group’s level of aspira 
tion. This is the relevance of the activity 
Relevance refers to the degree to which an 


activity 1S important to the purposes ol, or 


for the maintenance of, a group. Schachter (7) 
has shown that the power of the group | likely 
to be stror ger upon persons engaging in an 
relevant to the group than upok per 
uctivity. It 
level set by a 
ta k, 


ires on the 


than 


activily 


on perlorming a nonrelevant 


hollow that the aspiration 


group concerning a relevant tends to be 
supported by stronger press 
that 


lor a 


person 
doe ' an 
task 


activity, 


to periorm level, 


aspiration level set 


up to 
nonrelevant 
Regardless of the relevance of an 


some persons May acs ept a group aspiration as 


We 


reasons 


their own more than do other individual! 
will not occupy ourselves here with the 
evel ol 


the 


lor variation in accepting the group’s 
aspiration. It is expected, however, that 


group’s level of would have more 


effect 


or a failure, the 


aspiration 


upon personal evaluation of a succe 


more the members accept the 
level of 


For 


aspiration as 


many there are 


og q 
{ roup 
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external criteria which help a member to 
judge how well he has performed an activity 
which is of interest to the group. An athlete 
knows whether he has won, and a salesman 
knov whether he has made the sale. The 
team’s, or staff’s, level of aspiration is not 


needed for matters such as these to help a 


person know if he has succeeded or failed, but 
the group’s aspiration is pertinent in influenc 

ing the extent to which the individual evaluates 
an objective failure as a failure and an ob 
success aS a assuming, of 


jective SUCCESS, 


course, he expects that the group will become 
aware of his performance. In the present study 
the subjects were made either to fail or to suc 
ceed, and had a clear knowledge of the level 


of their performance. The major issue is: 


what conditions in the group may influence a 
person’s evaluation of his performance when he 
knows that he has 

From the considerations thus far, two gen- 
are derived: (a) When a task 


is relevant to a group, the group’s level of as 


succeeded or failed? 
eral propositions 


piration will raise or lower a member’s evalua 


tion of his performance more, after an ex- 


perience of objective success or failure, than 
when the task is 
The group’s level of aspiration will raise or 


nonrelevant to the group. (d) 


lower a member’s evaluation of his perform- 
ance to the degree that he ace epts the group’s 
level of aspiration as his own 

suggested by these 


opecili hypotheses 


propositions, but more amenable to verifica- 
tion, are as follows 

1. A level of aspiration set by a group for 
its members is more often accepted by a mem 


ber as a persona! goal when the activity is 


relevant to the group than when it is non- 
relevant 

2. A failure on a relevant task generates 
poorer self-evaluation of a  person’s per 


formance if a group’s aspirations are high 
than 1 


failure on a nonrelevant task is not affected by 


they are low (while the evaluation of a 
different levels of the group’s aspirations 
2-a. A failure 


group’ level ol 


when a person accepts the 
aspiration as his own, gen- 
lower self-evaluation of his 


erat per- 


formance if the group’s aspirations are 


high than if they are low (while evaluation 
of a failure, when the group’s aspiration is 
affected by different 


of the group’s aspirations 


re je ted, is not levels 
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3. A success on a relevant task generates 
better self-evaluation of a person’s performance 
if a group’s aspirations are high than if these 
aspirations are low (while the evaluation of a 
success on a nonrelevant task is not affected 
by different levels of the group’s aspirations). 

3-a. A success when a person accepts the 
group’s level of aspiration as his own, gen- 
erates better self-evaluation of his per- 
formance if the group’s aspirations are high 
than if they are low (while the evaluation 
of a success when the group’s aspiration is 
rejected is not affected by different levels 
of the group’s aspirations). 


Self-Esteem 


When a person evaluates his level of achieve 
ment he refers not only to the expectations of 
other persons but to the concept he has of 
himself as well. The degree of an individual’s 
self-esteem, therefore, is expected to affect the 
way in which an individual 
performance in a particular situation. Self- 
esteem is a function of the degree of coincidence 
between an_ individual’s and actual 
concepts of himself. The more that the actual 


evaluates his 


idea! 


and ideal concepts coincide, the higher the 
self-esteem. 

Persons who are high in self-esteem, we as- 
sume, differ from those who are low in the way 
they react to an immediate experience which 
has been a test of their abilities. Cohen (1) 
has reported that persons with high self-esteem, 
as compared, to those with low, are char- 
acterized by tendencies to protect themselves 
from negative self-evaluation. It is not clear 
why this should occur. Perhaps persons with 
high self-esteem have learned to overlook 
their objective failures and to concentrate 
upon their successes in building their concept 
of themselves, or perhaps they have con- 
sistently performed in accordance with their 
ideals and hence are less threatened by an ex- 
perience of failure. In any event, it would be 
expected that persons with high self-esteem 
would be able to evaluate an objective failure 
as a small failure and an objective success as a 
large success. Persons with low self-esteem, 
however, because they do not protect them- 
selves from negative are more 
likely to evaluate an objective failure as a very 
poor performance and a success as a small 


evaluation, 


SUCCESS. 








Group EXPeEcTATIONS, SELF-ESTEEM, AND SELF-EVALUATION 


When the person is a member of a’ group 


which holds certain standards for his 


formance, how does the level of his self-esteem 


per 


affect his evaluation of his achievement? 

A number of different answers to this ques 
the 
alternatives cannot be made with confidence, 


tion appear plausible. A choice among 
however, because there is no way of anticipat 
ing whether the potency of a scale of reference 
provided by the power of the group will equal 
that offered by a person’s seli-esteem; or 
whether the weight of the group and individual 
influences will differ under certain conditions, 
but not under others. Speculation about these 
matters is postponed until it is clear how the 
group standards and self-esteem interact in 
determining the way a person evaluates his 
level of achievement. 


METHOD 


An opportunity to examine these phenomena was 
created in a laboratory experiment in which the effects 
of both group pressures and self-esteem upon self 
evaluation group 
aspiration were created, termed high and low expecta 
At the same time, the relevance of the 


could be observed. Two levels of 
tions hereafter 
activity to the group’s purpose was varied by describing 
the S’s task as essential to the group goal in one set of 
conditions and nonrelevant to the group goal in the 
other. Thus, there were four conditions consisting of 
high or low expectations in combination with relevant 
or nonrelevant tasks. Finally, within each of these con 
ditions, half of the Ss were made to fail on their tasks 
and half to 
measured in a questionnaire administered after the Ss 
had worked on the tasks but while they were ass 
variables 


succeed. The dependent variables were 


uming 
that the experiment was not yet finished. The 
were the Ss’ evaluations of their performances on the 
task and their concern with the group’s expectations 


(i.e., acceptance of the group’s level of expectations) 
All were measured on 6-point graphic rating scales. In 


addition, a measure of self-esteem was administered 


before the experiment was started, and again as part of 
the second questionnaire 


Measurement of self-esteem. The measure of self 
esteem was designed in accord with the definition of 
used | 


the concept and modeled after an instrument 
Cohen (1 
whic h he 


y 


A person’s score describes the degree to 


meets his needs in ways he considers to be 


ideal 
The items of the test area series of brief paragraphs 
each describing a hypothetical sitvation in which a 


person faces a potential frustration oj a need. Problems 


of meeting needs in hive repre sentative areas are in 


cluded 
and autonomy 


recognition, affilatior mgnition 


the S 


achieveme nt, 
After reading each paragraph 
indicates how choosing from 
among five 
the reactions most and least descriptive of the way a 


a person would react 


behavioral alternatives. He first chooses 


person should ideally behave. Then, after a second reas 


wn 
~ 


ing of each paragraph, he chooses the reactions most 
and /east descriptive of the way a person actually would 
behave 
Phere follows an example of one of the hypothetical 
situations and the alternative responses for it 
A person has always valued his sense of freedom 


and independence. Because of what others think and 


say, this sense of freedom and independence is 
greatly hampered 
Most Least 
a. Reason that maybe a great many 
mutual responsibilities with others are 
most important after all . . 
6. Figure that while things are con 
fining now, knowing where one stands in 
relation to others is half the battle . . 
c. Tend to turn his thoughts to more 
pleasant aspects of his dealings with 
others . . 
d. Face the fact that other people are 
over-eager to maintain their opinions and 
ideas in the face of his . . 
e. Think twice before overly involv 
ing himself with others because these 
experiences are uncomfortable for him . . 


The responses are scored by giving a +1 for each 
occasion where the actual most preferred reaction is the 
same as the ideal most preferred reaction, and where 
the actual least preferred is the same as the least pre 
ferred. A score of 1 is given for a coincidence of the 
actual most preferred with the ideal least preferred, 
and of the actual least preferred reaction with the ideal 
most preferred. If no coincidence occurs, no score is 
given 


all items 


I'he person’s score is obtained by summing across 


In previous work with a longer version of this meas 
ure on a sample of sixty persons, the odd-even split-half 
was .M4 9] 


Spearman-Brown formula 


correlation when corrected by the 
In order to measure self 
esteem in the laboratory, however, the requirements 
made it necessary to use only one-third of the hypo 
thetica! situations of the original instrument. Scores on 


all the items were significantly related to total scores 


in the larger instrument 
The sequential phases of the experiment are now 
dex ribed 
Group formation on a cooperative task. The 


of the 


purposes 


opening part of the experiment were, first, to 


form a basis for the subsequent manipulation of rele 


vance and, second, to make membership in the group 


salient for the Ss. Making membership salient, it was 


assumed, made attraction to the group high so that 


group pressures were acceptable to the members. These 


} 


purposes were accomplished by having the group work 


and succeed on a cooperative task, in competition with 


other, fictitious groups 


The Ss were 175 male undergraduates at the Uni 


versity of Michigan who volunteered to participate in 





the study. They came to the laboratory four, five, or six 
at each session, and care was used to ensure that they 
were strangers to one another. As they came to the 
laboratory, they were taken to individual! cubicles to 
fill out the self-esteem questionnaire. When all of them 
had finished, they were eated in a circle to receive 
instructions, and were told that they would be king 
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pleting his individual task, and that we expect the 


others to do the same 


The two expectations conditior were created by 


giving the contrived tabulations favoring high or 
low levels 

Achievement level. In the success condition, the task 
was easy enough to allow 95 per cent of the Ss to com 


plete it in the allotted time. In the failure condition, the 


Ss were given puzzles which were impossible to complete 


because no piece met the criterion of a successful solu 


tion. Blank jigsaw puzzles (i.e., no pictures on the faces 


ol the pieces) were used in order to make it uncertain as 


to how close they had come to the correct way of 


assembling the parts if they did not complete it. Thus 


a failure could or be an objective failure in fact, and 


no gradations of partial success were possible 
Afrer the Ss 


were again given a question 


Ouestionnatré were through working on 


the individual! task, tl 


naire with the understanding that they would do the 
second group letter after the form was finished. When 
the questionnaire was told that 
there was no second group task and the experiment was 


The purpose of the study was then explained, and 


completed, they were 


over 


all questions answe red 


REsuLTs? 
The 


members 


self-evaluation 
that 


Group expectations and 
first hypothesis, which asserts 
will accept the level of the group’s expecta- 
tions as a personal goal line more often when 
the task is than 
vant, was examining the degree to 
which the Ss 
about the team’s expectations while they had 


relevant when it is nonrele- 
tested by 
said they were “concerned” 
been working on their individual tasks. The 
mean concern in the relevant condition is 4.00, 
which is significantly greater than the mean of 
3.27 in the nonrelevant condition (p 01, by 
t test). The data support the hypothesis. 

Ihe second hypothesis states that a person 
who fails on a task to the 


group will evaluate his performance lower, if 


which 1s relevant 


the group’s expectations are high than if the 
Table 1 it 


seen that in the relevant condition the 


group’s expectations are low. In 
may be 
mean self-evaluation is lower (2.05) when the 
group expectations are high, than when the 
The difference 
between these means-is significant by a one- 
tailed ¢ test at the .05 When the task 


was nonrelevant, and the members failed, the 


expectations are low 2.65 


level 


* A comparison of any two means which Is not spe 


cifically mentioned in the text may be assumed by the 


reader to be not significant at the .05 level or better, 


except for the contrasts between means of self-evalua 


tion in succe conditions with those failure condi 


} } 


tions, all of which are significant 
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level of the group’s expectations creates no 


significant difference between the mean evalua 


tions of performance (2.00 and 1.90). The 


data support the second hypothesis 


Hypothesis 2-a states that when persons 


fail on an activity, they will evaluate their 
achievement as poorer when the group’s ex 
pectations are high, and will evaluate their 


performance as better when the group exper 
tations are low. This effect will be more marked 
for persons with high concern than for those 
with little concern about the group’s expec 
tation 

In Table 2 the 


those Ss with high concern for the group’s 
g g | 


mean self-evaluation for 


expectations is significantly lower, where the 


group’s expectations are high, for a relevant 


task which is failed (1.83 than where the 
group's expectations are low (2.83 This 
difference is significant by a / test at .05. The 

who were little concerned about the 


persons 
group’s expectations do not differ significantly 
in their evaluation of their failure experience 
according to whether the group’s expectations 
were high or low (2.36 and 2.20 respectively 


When the task 


degree of concern about the group’s exper ta 


was nonrelevant, the members’ 


tions does not generate a significant difference 
in the way that they failure. It 
appears that the group’s level of expectations 
had its 
failure 


evaluate a 


trongest effect upon self-ratings of 
the task and the 
highly the 
support 


when was relevant 


concerned about 


These 


members were 


group’s results 
Hypothesis 2-a 


Che third hypothesis proposes that a person 


exper tations 


who succeeds on a relevant task will evaluate 
his performance lower when group expectations 


are low, and higher when group expectations 


are high. No significant differences are found 

Table 1) in « omparing the means of those who 
rABLE 1 

M ZAN SELF-EVALUATION OF SUCCESS ON TASK FOR ALI 
(ONDITION 


N’s in parentheses 


Nonrelevant task 


_— High group | Low group | High group | Low group 
ectations expectations expectations expectations 

Failure 2.05 (25) 2.65 (21)|2.00 (20) 1.90 (25 
Success 4.80 (22)|4.59 (22))4.95 (20) 4.80 (20 





. 
AND SELF-EVALUATION 59 
rABLE 2 
MEAN SELF-EVALUATION OF SuccESS ON TASK FOR ALI 
CONDITIONS, COMPARING PERSONS wiTH HIGH AND 


Low ACCEPTANCE OF Grovup’s EXPECTATIONS 


N’s in parentheses 
Relevant task Nonrelevant task 
Achievement 


Low group High group | Low group 
expectations expectations expectations 


High grou 
expectations 


\. Higl acceptance of group’s expectations 


Failur 1.83 (12)2.83 (12)|1.80 (10)|\2.40 (5 
Success 4.73 (15)'4.69 (13) 5.09 (11) 5.00 (9) 
B. Low acceptance of group’s expectations 
Failure 2.36 (11) 2.20 (10)|2.60 (10) 1.69 (16) 


Success $50 (10) 4.60 (10) 4.78 (9 4.69 (16 


succeeded, regardless of the group's level of 


aspiration or the relevance of the task. Thus, 
the third 


Hypothesis 3-a it was 


hypothesis is not supported In 


predicted that a self 


rating ol success alter a success expenence 
would be a direct function of the group’s level 
of expectations, more so for persons with high 


The 


anticipated results did not occur; thus Hy 


concern than for those with little concern 


pothesis 3-a is not supported 
Self-esteem and self-evaluation. We examine 
now the effect of an individual’s self-esteem 


upon his evaluation of his achievement 
Self-esteem was considered to be an enduring 
personality characteristic which the Ss brought 
with them to the laboratory. It was assumed 
that it would not be changed by the experience 
there. The reasonableness of this assumption 
is shown by a comparison of the mean self 
esteem scores obtained prior to the experiment 
the final 


differences ap 


with scores obtained at the time of 


questionnaire. No significant 


peared between any of the experimental 


conditions. Thus, the events in the experiment 


did not change the participant’s self-esteem to 


a Significant degree 


Persons with high self-esteem have little 


more positive evaluation of their 


than 


other matter 


own per 


formance those with low self-esteem, 


when all are equal. The mean 
with high self 


with 


made by 

esteem is 3.95 and the 

low self-esteem is 3.88 
Members who failed on the task 


teem evaluated 


self-ratings person 


mean for those 
however, 


who were high in self-e their 
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rABLE 3 
MEAN Se_r-EVALUATION OF Success ON TASK FOR ALL 
CONDITION COMPARING Persons witu HicH 
AND Low Se.r-EstTeem 
V's in parenthese 
Relevant task Nonrelevant task 
Achievement 
reves High group Low grou; High group | Low group 


expectations expectations expectations expectations 


A. Persons with high self-esteem 


Failure 2.36 (8 2.€) (15).2.67 (12)'2.45 (11) 
SUCCESS 4.70 (10)|4.42 (12)\4.82 (11)|5.25 (8) 
BK. Persons with low self-esteem 
Failure 1.86 (14) 2.43 (7 1.50 (8 1.20 (10) 
Success 4.4/ 15) 4.78 (9) |5.11 (9 4.59 (17) 


performance higher than those with low self- 
esteem, all other variables constant. Tie means 
are 2.55 and 1.75 respectively, and are sig- 
nificantly different at the .05 level. Among the 
persons who succeeded, those who were high in 
self-esteem were not at all different from per- 
sons low in self-esteem in the self-ratings they 
assigned to their achievement (4.75 and 4.83 
respectively). The 
self-esteem, then, affected evaluation of per- 
failed, but not 


amount of the member’s 


formance when the person 
when he suct eeded 
the members 


It was apparent earlier that 


were more likely to evaluate a failure in ac- 
cordance with the group’s level of expectations 
when the task was relevant than when it was 
nonrelevant. It is striking, now, to observe 
that self-esteem appears to have served as a 
scheme of reference for the members more in 
the nonrelevant condition than in the relevant 
one. In Table 3 it may be seen that where the 
expectations of the group are high and the 


person fails on a task which is nonrelevant, 


those with high self-esteem have higher 
evaluations (2.67), than the persons with low 
self-esteem (1.50 The difference between 


these level. 
Where the expectations of the group are low 


and the persons fail on a task which is non- 


means is significant at the .05 


relevant, those with high self-esteem again 
2.45) than the 
(1.20). The 
difference between these means is significant 
at the .01 level. When the Ss failed on a rele- 
vant task, no significant differences were found 


have higher self-evaluation 


individuals with low self-esteem 


in the self-ratings of those with high self-esteem 


E. THomas, A. R. COHEN, 
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compared to those with low self-esteem. It 
appears that the members were more likely to 
evaluate their performance according to their 
levels of self-esteem when the power of the 
group was weakest (i.e., the task non- 
relevant), but less likely to do so when the 
power was strongest (i.e., the task was rele- 
vant). This statement is true, however, only 
when the members failed. Among those who 
succeeded on the task, with high 
self-esteem had no different feelings of success 
than those with low self-esteem when level of 
expectation was held constant, regardless of 


was 


persons 


the relevance of the task. 

In the opening discussion it was anticipated 
that persons with high self-esteem would 
better protect themselves against poor evalua- 
tions than those with low self-esteem. We have 
noted that persons with high self-esteem com- 
pared to ones with low self-esteem, evaluate 
their performances more favorably when they 
have failed. We have also noted that when self- 
esteem alone is varied there is no difference in 
the evaluation of their performance. These 
last data suggest that persons with high self- 
esteem do not always rate their performance 
more highly than persons with low self-esteem. 
Can we say, then, that persons with high self- 
esteem appear to protect themselves? Do they 
do this regardless of the group’s expectations? 
The data in Table 4 are pertinent to these 
questions. They indicate that persons with 
high self-esteem, compared to individuals with 
low self-esteem, are cautious about becoming 
concerned with the group’s expectations. 

The degree of concern felt for the group’s 


TABLE 4 


MEAN CONCERN witH Group EXPECTATIONS FoR ALI 
CONDITIONS, COMPARING PERSONS witH HIGH 
AND Low Se.r-EsTreem 


(N’s in Parentheses) 


Relevant task Nonrelevant task 


Achievement 
level High group | Low group | High group Low group 
expectations expectations expectations expectations 


A. Persons with high self-esteem 


Failure 4.25 (8) 4.20 5) 2.83 (12)|2.91 (11) 
Success 3.40 (10) 3.72 3.18 (11 (9) 
B. Persons with low self-esteem 
Failure 3.86 (14) 3.69 (7) 4.63 (8) 2.70 (10) 
Success 4.67 (15) 3.63 (8) 3.67 (9) 2.94 (17) 
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expectations differed for persons with high 
self-esteem compared to those with low es- 
teem, on the nonrelevant tasks. As may be 
seen in Table 4, persons with high self-esteem 
stated that they had little concern for the 
group’s expectations when they had failed and 
the group’s expectations were high (2.83). 
Those with low self-esteem, on the other hand, 
when they had failed under high group expec- 
tations, expressed high concern for the group’s 
expectations (4.63). The difference between 
these means is significant at the .01 level. In 
addition, participants with high self-esteem 
said they were most concerned about the 
group’s expectations when they had succeeded 
and the level of the group’s expectations was 
low (4.11). While Ss with low self-esteem were 
little concerned with the group’s expectations 
when they had succeeded and the group’s 
expectations were low (2.94). The difference 
between these means is significant, however, 
at only the .10 level. 

It appears that Ss with high self-esteem 
became interested in the group’s expectations 
when the expectation levels were easy to 
achieve and in fact had been reached, and 
less interested in the group’s expectations 


were 
when these expectations were difficult to reach 
and they had failed to perform up to that level. 
Persons with low self-esteem were in contrast: 
they became attentive to the group’s wishes 
when these were difficult to gain and the person 
had failed, while they were less attentive to 
the expectations when the level was easily 
available and they had already succeeded. The 
results suggest that persons with high self- 
esteem better protect themselves against un- 
favorable evaluation by avoiding involvement 
in the group’s expectations when an unfavor- 
able comparison with others would be likely 
to occur 

In the relevant situation the member’s self- 
esteem was not related to his degree of concern 
over the group’s expectations to a significant 
degree 

DISCUSSION 


It is to be remembered that the Ss had the 
criterion for a successful performance clearly 
in mind. They knew whether they had su 
ceeded or failed on their individual tasks at 
the time they were asked to stop. With this 
in mind we may wonder why the Ss should pay 
attention to the expectations of their group 


Festinger (4) has offered a proposition which 
points to the salience of this question: “‘When 
an objective, nonsocial basis for evaluation of 
one’s ability or opinion is readily available, 
persons will not evaluate their opinions or 
abilities by comparison with others.” It is 
clear from the context of this quotation that 
Festinger means to include conformity to 
group pressures, as well as simple comparison 
of own performance with the achievement of 
others. 

The results suggest that persons who were 
successful did not tend to evaluate their per- 
formance in the light of the wishes of the 
group, but that those who failed were affected 
by the group’s expectations. Successful persons 
clearly evaluated their performance as highly 
successful regardless of whether the task was 
relevant or nonrelevant, the group’s expecta- 
tions were high or low, or the group’s expec- 
tations were an important concern to them or 
not. Failing persons, however, evaluated their 
performance low if the group had high expec- 
tations, and they evaluated their performance 
high if the group had low expectations. The 
effect of the level of expectations upon the 
evaluation of their performance was also 
greater the more the persons were concerned 
about the expectations of the group. These 
results occurred when the task was relevant, 
but did not occur when the task was non- 
relevant. 

Why did the wishes of the group determine 
self-evaluations for persons who failed more 
than for ones who succeeded? A failure ex- 
perience, we believe, has an important property 
which is not present in a success. The person 
who has failed cannot be sure how the group 
will receive his report that he has failed. An 
audience for a report of failure, especially 
when it is an attractive group, is likely to 
create some embarrassment for the reporter. 
Uncertainty concerning the consequences of a 
report of failure makes the person who has 
failed highly sensitive to the opinions and 
evaluations of the others. Thus, social reality is 
more important to a person who fails than toa 
person who succeeds. The more the relevance of 
the task, the more does the person who has 
failed feei impelled to evaluate himself in ac- 
cordance with the group’s expectations. The 
person who has succeeded, in contrast, can be 


quite sure that his report of success will be 
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ved regardless of the group’s 


He does the 


and is quite ready to 


favorably rece 


level of expectation not need 


group for social reality 
evaluate a success as a success regardless of 
These results 

that 


basis for self 


the group’s wishes in the matter 


and their interpretation suggest even 


though an objective nonsocial 
evaluation is available, a person may evaluate 
himself by comparison with others, if he has 


a task 


note worthy 


which is relevant to the group. 
It i that 
vance of the task, exclusive of the group’s 
did affect the 
that a person placed upon a failure experience 
that 
ng a group down” on a relevant task is no 


failed on 


reievance or non- 


re 


expectation not evaluation 


These result indicate, we believe, 


ett 


more conducive to poor self-evaluation than 


failure on a nonrelevant activity. The evalua 


tion a person assigns to his performance is 


primarily affected by the group’s level of 
task is relevant 

the effect of self 
everal interesting issues. Why, 
did self-esteem affect the 


achievement more when 


expectalior where the 

The results concerning 
esteem sugyesti 
lor example mem 
ber’s evaluation ol his 


he had failed on a nonrelevant task than when 
he had failed on a relevant one? If it is assumed, 
as we have done, that group pressures are less 
potent in the nonrelevant condition, it is 
probable that the influence of self-esteem upon 
personal evaluation was relatively stronger 
than the power of the group in that condition 
both be 
the 


and because a failure experience made 


In the relevant condition, however 


cause relevance increase the power ol 


grou] 
a scale ol reterence, 
the 


the g oup more potent as 


elf-esteem was relatively weaker than 


a determinant of the self-evaluation. 


lief ha 


group a 
Phi " 


irom 


been supported by the findings 
yet unpublished (1, 8 
the be 


Vas predi ted be ause of 


evera! studies, as 


tigation variation in 
which 


variation in their personality 


In these inve 


havior of S 


characteristics, 
occurred more often when influences from the 
group were weak than when influences fron 
the group were strong 

The results indicate that persons with high 


self-esteem, as we have conceived of it and 


measured it. lealt with their experiences in the 
laboratory in a way that helped them to main- 


high 


themselves better after 


tain their sell-esteem They tended to 


evaluate a fasture than 
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individuals with low self-esteem, regardless of 
the level of the group’s expectations. Further- 
more, they became more concerned about the 
group’s expectations when favorable evalua- 
tion was readily possible for them. Persons 
with low self-esteem reacted to their ex- 
periences in a way that made it difficult to 
improve their rhey only 


evaluated a failure very low, but they became 


self-esteem not 
more concerned with the group’s expectations 


when an unfavorable evaluation most 
likely for them. Persons with high self-esteem, 
in brief, reacted to their experiences in the 


group in such a way that they best protected 


was 


themselves from unfavorable evaluation 


SUMMARY 


In a laboratory experiment, members were 
made to believe that their group had either 
high or low expectations for the quality of 
tasks. The 


relevant or 


their achievement on individual 
tasks 
nonrelevant to the purposes of the group, thus 
varying the the the 


member’s activity. Half of the Ss within each 


were described as either 


power of group over 
of these four conditions were allowed to suc- 
to fail. In 


addition, measures of self-esteem were avail- 


ceed and the other half were made 


able for all participants. 
The following hypotheses were supported. 
1. A level of aspiration set by a group for its 
members is more often accepted by a member 
as a personal goal when the activity is relevant 
to the group than when it is nonrelevant. 
2. A task 


poorer self-evaluation of a person’s perform- 


failure on a relevant generates 
ance if a group’s aspirations are high than if 
they are low (while the evaluation of a failure 
on a nonrelevant task is not affected by differ- 
ent levels of the group’s 


2-a 


aspirations) 

A failure, when the person accepts the 
group’s level of aspiration as his own, gen- 
erates lower evaluation of his performance if 
the group’s aspirations are high than if they 
are low (while evaluation of failure when the 
group’s aspiration is rejected is not affected by 
different levels of the group’s aspirations) 

the 


level of the group’s aspirations upon the eval- 


Hypotheses concerning the effect of 


uation of a successful performance were not 


supported It was proposed that a person 1S 


more likely to evaluate a failure than a success 








of 
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in reference to the group’s expectations; be 
cause the failure makes him more dependent 
upon the group for social reality concerning 
evaluation, than does success 

Persons high in self-esteem protected them 
selves against unfavorable evaluations better 
than those low in self-esteem. 

The group’s expectations appear to have 
been more potent as a scale of reference than 
the individual’s self-csteem in determining his 
evaluation of his performance. When the 
influence of the group was weakest (task was 
nonrelevant) persons high in_ self-esteem, 
compared to those who were low, differed in 
the way they evaluated their performance 
When the influence of the group was strongest 
(task was relevant) there was no difference in 
the way that persons high or low in self- 
esteem rated their achievement. 


1 


5 
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AGE DIFFERENCES IN VERBAL LEARNING! 


SHELDON J. KORCHIN 

Institute for Psychosomatic and Psychiatric 

HE aphorism that “you can’t teach an 

old dog new tricks” undoubtedly pre 

dates psychological research both in the 

field of aging and of learning. However, the 
pecitic conditions which determine the rela 
ive ease or difficulty of learning for the older 
perso! the importance of different types of 


material, learning situations and the like, have 


been a concern of systematic and experimental! 
work within the psychology of learning only in 
relative recent years 

Of considerable interest is a 1934 study by 


which h become a 


issions of age-related 


Ruch (13 


relerence in 


is since 


disci 


changes in human learning. With the view that 
enility can be conceived as a condition of 
lowered hiologica pla licity, Ruch hypothe 
sized that der compared to younger persons 
hould show a greater deficit in any learning 
situate hich re julres the reorganization ol 
existing habits, less deficit where earlier exper! 
ence in be utilized in new learning, and have 
an intermediate amount of deficit where old 


iil | nor block new learning 


habits neither 


In his experiments Ruch (13) had three age 
rr earn a pursuit rotor task, first in 
dire en ) mirror vision, and three 
lists of paired-associate materials These lists 
differs n the degree to which they utilized or 


contradicted earher learning: (a paired associ 
il¢ lam I V¢ rd logically related: b 
nonse¢ materia irbitrary equations of the 
form E X Z G ind interference ma 
ter e equations ol the form 35 X 5 0p 
| ’ , it liffe ' 
| est iSk ere presumec oO aiter not only 
mn dif | ind nove vy but also in the degree 
to which prior experience might enter to facili 
tate or rupt present learning. In the word 
1SS¢ " isk earlier learning should facilitate 
I i ipport e I Ay 
ur ( t No. DA-49-007-M]D-261 tl izh the 
Me i esea i Devel ent Bo Ott f 
the , General, Departme ft \ Wasl 
ing lL). ©. We are grate Mr. Ben Gross 
Dire« he f the .D Kt Hi ‘ r thet 
com] ol i ! rv t i i 
tie = Albert Car ul Mrs. A Crit he er 
re ! the ita collect i Dr 
Helen Heat] the stistica a 
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the learning of these simple associates; the non- 
sense equations required the learning of essen- 
tially new material which should be unaffected 
by earlier information; while in the case of the 
false equations the material was not only new 
but contradictory to previously-formed fa- 
miliar With material 
might expect competing responses, the highly 
overlearned true products of simple multiplica- 


associations such one 


tions, to interfere with the acquisition of the 
new habit 

Comparing the differences in mean scores 
between age groups on each of the procedures, 
Ruch (13) found that the young-old difference 
was greater for the mirror than direct vision 
perceptual-motor task and, among the verbal 


tests, was greatest for the interference ma 
terial, less for the nonsense task and least for 
the familiar word associates. On this basis 


Ruch concludes that older persons have least 
deficit” in learning materials which are com- 
patible with habitual behavior and greatest 
the earlier 


habits. However, if one compares the scores for 


where new learning contradicts 
each procedure within each group separately, 
it is interesting to note that all groups per- 
formed better on the interference material than 
on the nonsense task, although the difference 


between young and old } greater on the former 
than on the latter 

As part of a larger study of psychological and 
neurophysiological processes in aging, Ruch’s 
three verbal learning procedures were used in 
+} 


rhe findings of other studies 
2, 10 


Che present paper will consider only the com- 


e present study 


in this series are reported elsewhere ( 


parison of two groups, distinct in age, in their 


pe rflormance on these proc edures. The purpose 


is twofold: (a) To repeat Ruch’s study and to 
retest his hypothesis; and (6) To see whether 
further understanding can be gained about 
age-related changes in cognitive functioning 


generally, and as they may account for per 
formance in these learning tasks 

METHOD 
Subjects 


the 


between 


Iwo di ye groups wert main 
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the ages of 22 and 33 years of age, and included 12 


resident physicians with a mean age of 28.2 years and 


nurses with a mean age of 26.3 years. The 


12 graduate 


dd group consisted of 36 residents of an institution for 
the aged, evenly divided by sex within three ag 
groups: 12 Ss between 65-74; 12 between 75-84; and 
12 above the age of 85 The Ss ere inciuded 1 the 
older sample only u the were in good general healt! 
free from organic brain pathology, and had acceptabl 
corrected vision 

However, the total young group was distinctly 
superior to the old in education and intellectual level! 
Since one n ight reasonably expect inte gence, in 
general, to be related to performance on these tasks 
and, in fact, Ruch (13) had found this to be tru 


particularly within his older groups, it was felt that a 


more direct and exact test of the differences in learning 


1 
I eral intelli 


Lj if 
could be 


made using groups equated for get 


gence nce vocabulary correlate v¢ with general 
intelligence as measured by omnibus batteries and 
itself shows relatively little change with age, the Vo 


Wechsler-Bellevuc 
I'wo matched groups of 16 Ss each were 


cabulary test of the Scale was used 
(1, 4, 8, 15 
drawn from the larger sample. The young group had a 
vocabulary score of 26.83 with a 


of 4.6; the old group 28.6 and 4.8, 


mean unweighted 


standard deviatior 
isted of 5 men and 


respectively. The young group cons 


11 women, with a mean age of 26.8; the old group of 
8 men and 8 women, with a mean age of 78.1 years 


rhe results of this experiment will be reported for 
these 


same analyses have 


selected groups matched in intelligence The 
been carried out, however, for the 
total groups and the results are quite similar, although 
in the 


are more extremé¢ 


main the differences between the larger groups 


The Learning Tasks 


« 


As part of a larger battery of procedures, each 5 
modified version of 
Ruch (13 As in his 


lists 1 


logically related 


was individually administered a 


‘ 


the learning tasks described by 


experiment, Ss had to learn three separate 


W ord 
word pairs such as Man—Bcy; 2. Nonsense Equation 
eg., E X Z = G; 3. False Equations, e.g.,3 * 5 6 


j 


In the present study the task was changed somewhat 


1 ssociates, consisting of familiar 


from Ruch’s original procedure After preliminary tests 


the length of list was reduced from 10 to 8 items and 


rABLE 1 


Lists Usep IN THE THREE LEARNING TASKS 


False 


Nonsense 
Equations I 


Word Associates 


quations 


Man-Boy EXZ=G «Kx 5S m6 


example example example 
1 Stem-Bud RxXS5S oO 2x & 
2 House- Visit H xX Y J 6X3 5 
3 Room Light BX D M 5x 1 7 
4 Soft-Chair AxXM B Dx I l 
5 rree-F lag sx Q H +x 3=4 
6 Nest-Ow! K XU #1 x 5 = il 
7 White-Pink WX L . 2xX4=9 
8 Walk-Car Lx Be#D $x 4 2 
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the number of trials from 15 to 6 in order to make the 
task easier and the session less fatiguing, particularly 


for the older Ss 


Each list consisted of 8 units Table 1 elected 
from the 10 used originally. Before each list was given 
was first shown a sample pair and the task explained 


After an initial presentation of the series, S was 


which he wa 


member of the pair when the 


6 trials in to anticipate the right 


It was first shown alone 


The items were projected by a 35 mn trip film 


projector in a sen 
for 4 seconds. The 


appeared alone for 


idarkened room, each frame remain 
left-hand m« r of the 


4 seconds, both art lor 


ing in view 
first pai 


4 seconds, then the left-hand member of the second 


| between 


pair, and so on 


trials and 


Eight seconds were allowed 
? minutes between lists. To prevent 


serial learning, the order of items was changed on each 


about 2 


trial 


The instructions used were substantially the ume 


as Ruch’s. The S was encouraged to guess i! uncertain 
In presenting the lists of 
tions the divided at the 


first 3 «* 5 was shown, then 3 & 5 6 


Nonsense and Fa Equa 


items were equal sign, ¢.g 


RESULTS 


In Table 2 are presented the mean scores for 


each of the three tests. Since there were 8 
items and 6 trials, the highest possible score for 
an individual is 48. Young Ss clearly perform 
Che difference between the 
O01 


clear 


better on each task 
groups is significant in each case at the p . 


level on U test. Moreover, it is equally 


that for each group the Word-Associate learn 
than either the 


learning of the Nonsense or False Equations, 


ing was decidedly easier 


although, surprisingly, there is virtually no 
difference between the latter two 

Ruch’s (13) deter 
deficit 


material, we 


Following analysis for 


mining variation in the amounts ol 


among the three types ol can 
compare the difference between old and young 
on each of the tests. For the Word 


this difference in score is 13.81, for the Non 


Associates 


sense Equations 17.57, and for the False 


Equations it is 17.62 (Table 2). These values do 
not differ significantly from each other 
However, the difference between the groups 


in the learning of the three types ol material 


rABLE 2 
MEAN LEARNING ScORES FOR YOUNG AND OLp Groups 





Wor False a] 
» Mean! Asso ense | Equa-| 4 - 
{ if ’ A r Equa ‘ » =z 
ge | ciates n s s 
WA FI s z 
Young 16 26.8 44.19 25.0024.81 43.59 44.21 
Old 16:78.130.38 7.43) 7.19 74.8674.96 
Diff 13.8117.57:17.62 


Iu M 
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TRIA 
ric. 1 EARNING CURVES FOR THE OLD AND YOUNG 
Groups ror Eacn or tHe Turere Test 
rhe curves for the young are heavily drawn, for 
the old lightly drawn. The three tests are distinguished 
Word Associate Nonsense Equa 
tior False Equatior 


can be viewed in another way. If we consider 
the proportional change between the tests it 
does appear that the older group has relatively 
poorer learning on both the Nonsense and 
False Equations when these are compared with 
the Word Associates. Subtracting the Word 
Associate score and the Nonsense Equation 


score and taking this difference as a fraction ol 


the Word-Associate score, the mean ratio 
WA NE ; 
) isa3 19 for the young Ss and for 
WA 
the old 74.86. The difference between these 
values is significant, on L’ test, at the p < .001 
leve Similarly, the ratio of Word-Associate 


score minus False Equation divided by Word 
Associate (= . ~) is 44.21 for the young 
WA : 


old (Table 2); on [ 
OO1 level 
onclude that not only 


group and ‘74.96 for the 


test significant at the p From this 


analysis we may tairly 

do the older Ss learn less in each of the three 
tasks, but that they are proportionately more 
deficient in the learning of materials in which 
the facilitative effects of prior experience are 


minimized. However, we have not found evi 


dence that interference materials are more 


difficult than nonsense materials 


The t 


froup 


ial-by-trial learning curves for the two 


on each of the tests show ihe difference 


in performance already discussed to be visible 


throughout learning (Fig. 1 The curves for 


Nonst ns 


ire virtually 


the ions 


e Equations and False Equat 
identical within each group and 
distinct from Word 


It does seem, however, that 


each Oo hese 1s Associates 


level the older 


group’s curves for the Nonsense and False 
Equation tasks approximate linear functions 
while their values on the Word Assoicates more 
nearly describe the more usual negatively ac- 
celerated learning function. Particularly in the 
early trials there is proportionately less rise in 
score on the meaningless and interference ma- 
terials. Using Edwards’ (5) suggested analysis 
of variance for trend, the curves for both of 
these iasks differ significantly from that for the 
Word Associates (p - 
difference in the forms of the learning curves is 
suggestive of Lewis and Shepard’s (11) finding 
of a negatively accelerated function for per- 


001 in each case). This 


ceptual motor learning consistent with past 
experience and of a linear or positively acceler- 
ated curve in a task which did not utilize old 
habits 

Not only do the old and young groups differ 
in the amount of material learned, but they 
differ as well in the type of error made during 
the learning process. In Table 3 the total 
number of errors made by each of the groups on 
each of the three tasks is broken down into 
various types. In both the Word Associates and 
the Nonsense Equations there are three distinct 
types of error possibie: S may not respond at 
all to the stimulus item (Vo response); he may 
give an associate which appears nowhere in the 
series (Random response); or he may respond 
with an item which although incorrect for the 
present stimulus item is appropriate elsewhere 
in the list For the False 
Equations list S can also give a response which 


(Correct elsewhere). 


represents a correct multiplication, or con- 
ceivably, he add or the two 
numbers in the stimulus equation. Although 


can subtract 


the completion of the multiplication is of par- 
ticular theoretical interest since it may repre- 


rABLE 3 


PERCENTAGES OF VARIOUS Types GF Errors MADE 


IN WorpD ASSOCIATI 
NI AND FALSE 


WA), NONSENSE EQUATION 
TEsTs 


EquaTion (FE 





Group est N yf & 
cond i Age ;, Errors - 
Young (16) 26.8 | WA 61 , ‘1 
NI ‘ 7 48 
FE 30 9 4 8 é 
Nd 16 78.1 WA « 1 1¢ 
NE 19 ) 
Fr , ; 18 1 i 2 


* These types of errors are only possible on the False Equation 


task 
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sent the intrusion of the competing, albeit 
actually correct, response, addition or sub 
traction may also reflect habitual response to 
the implied structure of an arithmetical prob 
lem. Scoring these errors does, of course, in 
volve interpretation since such response might 
as well be a Random response or, in a particular 
case, Correct elsewhere 

There is considerable difference in the pat 
tern of errors between the two groups (Table 
3). On all three tasks the typical behavior of 
the older person 1s not to respond, distine tly 
less often to give a response that is Correct 
elsewhere. Compared to the old group, young 
Ss have proportionally more of their errors of 
the latter type. They are also more prone to 
give Random responses and in the case of the 
False Equations to multiply, add or subtract 
the items, although there are few of these 
errors in either group 

Closer inspection of the trial-by-trial changes 
in the pattern of error gives further evidence 
of the difference between old and young 
learners. On all tasks on each successive trial 
the young group has relatively fewer of its 
errors in the Vo response class and an increasing 
proportion of Correct elsewhere errors. On the 
Nonsense Equations task, for example, of the 
total errors made by the young Ss on the first 
trial 57% are No response and 30% Correct 
elsewhere, which changes gradually to 30% No 
response and 66% (¢ ‘orrect elsewhere by the sixth 
trial. However, the old group preserves roughly 
the same proportions on each trial. Thus, for 
the same task, these Ss start with 95% No 
response and 3% Correct elsewhere and by the 
last trial still have 92% and 8% of their errors 
in each of these classes. Although there is 
somewhat more tendency for the No response 
proportion to go down on the Word Associate 
and False Equation tasks, the pattern is 
basically the same. Thus, it is clear that simply 
not responding to a stimulus item is character 
istic of the learning behavior of the older Ss 
throughout; it is less common in the younger 5 
at any point and declines greatly as learning 
pro eeds 

Lastly, how related are the performances of 
individuals on the three learning tests? The 
correlations among the total scores for the three 
tests within each of the age groups are given 
in Table 4. The results are surprising. For the 
young Ss there is a positive relationship among 


all the procedures, lowest between the Word 
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rABLE 4 
CORRELATIONS AMONG LEARNING TASKS 
Young Old 
Word Associates vs. Nonsense 372 506* 
Equations 
Word Associates vs. False 607* 317 
Equations 
Nonsense Equations vs. False 740°* 240 


Equations 


ignificant at p « 


** Significant at p < .00 


Associates and Nonsense Equations, and quite 
high between the two forms of equations. How 
ever, the older group has its highest correlation 
between the Word Associates and Nonsense 
Equations and its lowest, and in a negative 
direction, between the Nonsense and False 
Equations. Thus, the patterns are opposite in 
the two groups. Moreover, these findings differ 
sharply from those of Ruch who found that all 
of the interest correlations were higher in his 
oldest group and highest among these was the 
correlation between the Nonsense and False 
Equations. Indeed, the relationship between 
these two tests was generally stronger than 
between any other pair in his experiments: 
+.404 in the young group; +.473 in the 
middie-aged; and + .833 in the old. We are ata 
loss to explain the differences in findings both 
between our two groups and between this ex- 


periment and Ruch’s. 


DISCUSSION 


here is no doubt that older persons in this 
as in Ruch’s (13) original study are less well 
able to master each of the learning tasks. As 
such, this finding is in agreement with a con- 
siderable body of materia! whi h shows a 
generally decreased ability to learn with ad 
vancing age (6, 12, 14). However, unlike Ruch, 
we have found virtually no difference between 
scores on the nonsense and interference ma 
terials and no evidence of a greater degree of 
deficit in the latter than in the former. As was 
noted earlier, the total amount learned by 
Ruch’s Ss of all ages was greatest on the Word 
Associate task, second most on the False 
Equations and least on the Nonsense Equa 
tions, while deficit, i.e., difference in perform 
ance of young and old, was least for the familiar 
material, greater for the nonsense material, and 
greatest for the interference material. However, 
it may be that the differences, though seem 








68 
ingly sma between Rucl and the present 
procedure are ificient to account for the 
obtained differences in re Perhap th 
onger t d more tr the early findings 
would have been repeated but in an ise the 
difference mn method do lu direct com 
pari ym ot re iit 

What we have found that, compared to 
the level of attainment on Word-Associate 
earning he older person now proportion 


the new and inter 


peak 


ately poorer learning on both 


‘ 1} 


materia ly mis sense, we can 


deficit both 


ference 


ol greater on of these materials, 


although there is no difference between them 


While it may be that older px rsons are 


when required 


, 
uniquely 
prone to disturbe | performance 
been sug 


the 


“isting habit is ha 
9, 14, 16), 


experiment 


to reorganize ¢ 
gested by a number of paper 


evidence of the present suggests 


that the older individual will perform less well 


in any situation in which he must deal with 
material which is in any way novel. When the 
task at hand can not be integrated with earlier 
experience, either because there is no pre: edent 


or existing precedent is inadequate or contra 
uffer Ihe les 


of the aged to deal with novelty was 


dictory, learning sened capacity 


also found 
in heightened decision times at the beginning 


of a series when S had to judge a number of 


ambiguous stimuli (10 


Older and younger Ss differ also in their be 
havior during learning. The younger, as we 
have noted, are more likely to make some re 
sponse to the timulus item. to offer hy 
potheses whether correct or incorrect, and as 


le arning proce ed to have an in reasiny propor 


tion of their errors responses which, although 
correct elsewhere in the list, are inappropri 
ately given to the particular stimulus. The 


older S will either respond correctly or not at 
all, and this behavior is consistent from trial 
to trial. In a study of the perception of closure 
showed a greater tendency not 


lest 


the older Ss also 


to respond items 
(2) 


We 


mine his 


o Gestalt ¢ omple tion 


believe that three factors jointly deter 


behavior of the older Ss. although 


Irom ¢ valilable evidence we cannot udge their 
J 


relative contributions 


l lhe older person | more cautious Despite 


the injunction to guess, it may be that older 
persons require greater certainty before they 
Although 


the item is equally incorrect whether 


S knows that 


he vives 


are willing to report 
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no response ora wrong respo ise, he apparently 
preters the error of omission to that of com 


mission. This tendency to inhibit response in 


the uncertain situation may reflect a more pro 
found personality defense of the aged through 
which the re« ognition ol inadequas y is avoided 

2. The older person requires more time for 
the integration of a response. A condition of the 
experiment which may account for some of the 
differences may be the rapid pacing of the 
items. As the experiment was conducted S had 
to read the stimulus item, select and report a 
response item within 4 seconds. Considerable 
work has shown that older persons require 
more time both for simple perceptual! discrimi- 
nation and in more complex decision-making 
(3). Moreover, requiring performance within 
arbitrarily short times can result in a significant 
reduction in the adequacy of that performance 
(16) that the 
processes of perceiving-judging-recalling-re 


Such findings would suggest 


sponding require more time for the older person 
than is available and that, perhaps, if the 
pacing were slower his behavior might differ 
less than that of younger individuals 

3. For the older person learning may prin 
the 
stimulus-response combinations, and 


discrele 
unlike 
the younger person, may not include a prior 
which the first 
learned as belonging somewhere in the series, 


cipally involve association of 


stage in response items are 
before they are attached to particular stimuli 
Mhat the pattern of errors is generally the same 
on all three learning tasks and that there are no 
important changes in pattern from trial to 
that the 


trial further suggest acquisition of 


separate and distinct units may be a general 
characteristic of learning in the aged. Implicit 
in this view is the assumption that there are in 
general two patterns ol learning: one in which 
there is first a more general learning from which 
particular associations are increasingly differ 
entiated; and a second in which only the 
stimulus response associations 


specific are 


formed. We might speculate that the second 
pattern, seen here in the behavior of aged 
persons, is generally characteristic of more 
constricted or rigid cognitive organization and 
might be found as well in the behavior of 
anxious or brain-damaged individuals 

From the first of these three views one as- 
sumes that the older learner may have avail 
unwilling to 


but is incorrect 


the 


able, report, 


hypotheses. According to the second, 
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formulation of such hypotheses can not be 
completed and reported within the time avail 
able. But on the last assumption neither 
caution nor time blocks response, rather there 
are fewer incorrect hypotheses available. The 
relative importance ol these factors could be 
explored in further experiments in which, for 
example, Ss might be required to make some 
response to each item and/or the time of 
presentation increased 

All of these factors, however, may be con 
ceived a parts of the total aging process In 
the aged there is a decreased ability to compre 
hend organize, and integrate new experience 
both in perception and in memory. Integration 
is likely to be incomplete oltten rigid or con 
fused, and the integrative process requires 
greater time. Performance is most adequate 
where the individual can draw most heavily 
from past experience, most disturbed when 
novel behavior is involved. As a consequence 


the aged person seeks the security of the 


familiar, risks fewer failures by making fewer 
ittempts, and so defends against the recogni 


tion of inade quacy 
SUMMARY 

An old and a young group of Ss, mean ages 
78.1 and 26.8 years, were given three paired 
associate learning tasks which differed in the 
degree to which prior experience might be 
expected to facilitate or block present learning 
They onsisted of (a) familiar word pairs, 
(b) nonsense equations, and (c) false equa 

Botl groups performed best on the word 
associate task, but there was little difference 


] 


the learning of nonsense and false 


between 


equatio within either group On ali three 
pro edurt the old yroup wa significant! 
poorer, but they were proportionately more 
deficie! n the learning of materials in which 
the facilitative effects of prior experience art 
minimized, i.e., the two torms of equation 
However, they had no greater difficulty with 
the interference than with the nonsense ma 
terial. The older person thus seems to be les 
( ipable n dea ing with novel materia which 
can not be readily intergrated with earlier 
experience 

Older and vounger Ss were also tound to 


differ in their behavior during learning 
Younger Ss are more likely tooffer hypotheses, 
whether correct or incorrect, and through 
learning to have an increasing proportion of 
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their errors responses which though correct 
elsewhere in the series are incorrect for the 
particular stimulus. By contrast, the old Ss 
respond corres tly or not at all, and the propor 
tion of “no response” errors remains high 
through learning. Three factors are discussed 
as joint determiners of this behavior: olde 
persons are mort cautious; they require more 
time to integrate response; and their learning 
may consist mainly of discrete stimulus-re- 
sponse associations, not preceded by more 
general learning. These factors are conceived 


as related parts of the total aging process 
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PERSONALITY, PROBLEM SOLVING, AND THE 
EINSTELLUNG EFFECT! 


BRENDAN A. MAHER 


Northwestern University 


poRNO (1) and others have reported a 


pattern of personality characterized 
certain and behavioral 


by cognitive 


tendencies among which is included rigidity of 


set and outlook (1, p. 561 Lhe concept ol 
rigidity has been employed in many ways, a 
comprehensive summary of which is to be 
found ina paper by Fisher (6), but a review of 
its uses is unnecessary here. This study is 


concerned with rigidity as defined by Adorno 


and his colleague pre conceived cateygoriz 


in relation 


ing, inaccessible to new experience”’ 
to certain other aspects of personality, in 
particular pseudo-scientific or antiscientific 


tendencies in interpreting individual behavior 
The “e 


the same 


hnocentric”’ individual is described 


as exhibiting 


by 


person ility organization that includes 


writers a cognitive 
“implicit 
denial of personality dynamics, ready explana 
tion by accidental factors, heredity, etc., super 
tition.”’ This method of organizing perceptions 
of the self and others 1 


and is contrasted with the tendency to explain 


defined as antiscientific 


human behavior in dynamic sociopsychological 
term 
Rokeach 


jar problem 


(12) has used the familiar water 


to demonstrate that rigidity of set 
n 


in problem solving is found more frequently i 


high” ethnocentric groups than in low”’ 
groups Hi finding have been challenged by 
Luchins (11) and Brown (2); the later conclud 
ing that this relationship existed only ~ in 
ethnocentric Ss when they felt an anxious 


concern over achievement 
Against this background the present study 


designed to test two simple hypothe es 


wa 
a) Ss who reveal antiscientific or nondynami 
tendencie in their perceptions of human 
behavior are more subject to the Einstellung 


effect in cognitive problem solving than Ss who 


use dynamic psychological concepts in inter 


preting human behavior. (6) This relationship 

' The data upon which this paper is based are taken 
from a dissertation submitted by the writer to the 
Graduate School of the Ohio State University in partial 


fulfillment of the requirements for the degree of Doctor 
ol Philosophy 


Particular acknowledgments are due 


George Kelly for his advice and encouragement 
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is more evident when the Ss are placed in a 


situation likely to rise to achievement 


anxiety. 
The use of ‘‘antiscientific’”’ bases for evaluat- 


give 


ing individual behavior has been described in 
an unpublished study by Sullivan (13). He 
used a method in which Ss were required to 
sort and group photographs of people on any 
basis that they might wish to use. The sorting 
categories reported by the Ss were then clas- 
sified as objective or nondynami 
(e.g. hair-color, 
general, etc.), or as inferential (dynamic 


concepts 


physical characteristics in 
con- 
cepts, the latter implying some attempt to use 
inferred psychological or behavioral char- 
acteristics. His findings were the reverse of 
those to be predicted cn the basis of the pre- 
vious studies It sug- 
gested post facto that in this experiment the 
would 


in ethnocentrism was 


“scientific” or objective approach 
properly utilize nondynamic categories, as the 
judgment of personality from photographs is 
essentially an unscientific procedure based on 
a willingness to infer personality attributes on 
the basis of pictured, external characteristics. 
lo avoid this ambiguity of interpretation, 
the present study used a measure derived from 
Kelly’s Role Construct 
(RCRT) (9). This procedure presents S with 


the task of evaluating particular individuals 


Repertory Test 


with whom he is personally acquainted, and 
may therefore be assumed to elicit the per- 
ceptual categories that he uses in everyday 
life. In the test, S listsa number of persons of 
his acquaintance, on the basis of definitions 
(Kelly’s “‘role-titles’’) provided for him. He is 
then required to consider particular trios of 
these individuals in turn. Each trio is divided 
by S into a single and a pair of individuals on 
the basis of some perceived similarity between 
the two members of the pair which is not shared 
by the third (single) member. For each trio, S$ 
reports the basis upon which the division was 
The list 
but the persons actually listed will, of 


made of definitions is the same for 
each 5S, 
course, be different. For any particular purpose 


the number and nature of the definitions may 
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be altered, and the number of trios may be of 
any size within the limits of combinations per- 
mitted by the number of definitions. 

One estimate of the reliability of the RCRT 
has been derived by Hunt (8). He administered 
the test to a sample of college students followed 
one week later by a similar administration but 
using different role titles. A mean percentage 
agreement of 70.1 was obtained. As responses 
were only regarded as in agreement when they 
were identical, it is apparent that any broader 
definition of similarity could not reduce Hunt’s 
percentage and would be likely to increase it. 


METHOD 
Subjecis and Procedure 


Undergraduates in an elementary mental hygiene 
class were divided into two equal groups 

ichievement anxiety group. These were students in 
one section of the course. At the beginning of the course 
they completed the RCRT. Five weeks later a mid 
quarter examination was held in which all Ss partici 
pated as a course requirement. At the next class meet 
ing the group was informed that the examination 
results had been very unsatisfactory and that as an aid 
to grading them two simple tests of “mental alertness” 
would be administered immediately. The problem 
solving tests were then administered in group fashion 
At the end of the class hour their graded examination 
papers were re turned to the group and the experiment 
explained. This procedure was intended to demonstrate 
that the problem-solving tests had not actually affected 
their examination grade and to eliminate any residual 
anxieties. It was £’s impression that during the test 
session the motivation of the group was sufficient to 
justify the inference that achievement anxiety was a 
relevant variable 

Control group. This grou», which comprised another 
section of the same course, completed the RCRT as 
above. The problem-solving tasks were administered 
in group fashion during a regular class meeting. They 
were presented With standard instructions as part of 
the demonstrations used regularly in the course 

This procedure was repeated over two academik 
quarters because of the attrition rate associated with 
errors in the completion of the printed problems. Com 
plete data were obtained from 62 Ss, 31 in each group 


Instruments 


Verbal maze problems. This series of problems was 
identical in content and sequence with that reported 
by Cowen? (5). Each maze was ditto-printed on a sepa 
rate page. The pages were stapled together in such a 
fashion that it was necessary to solve each maze in 
turn and then detach it before being able to inspect the 
next problem in the series. Following Cowen’s method, 


the problems were presented in sequence commencing 


2 The complete set of mazes was supplied to the 


writer by Cowen in a personal communication 
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TABLE 1 
EINSTELLUNG PROBLEMS: WATER-JARS SERIES 


Quan 
Number Available” be Ob. Te of Problem 
tained 
39 4 31-2 jar practice 
B x0 40 3 4 43 jar practice 
1 27 74 20 7 Control 
2 42 71 5 19 = Ist Einstellung 
3 50 81 7 17 2nd Einstellung 
4 23 st 9 16 3rd Einstellung 
5 27 71 14 16 4th Einstellung 
6 35 77 17 8 Sth Einstellung 
7 we. 12 ist Crucial 
8 32 79 #15 17 2nd Crucial 
9 37 90 «16 21 jrd Crucial 
10 . fo 18 4th Crucial 
11 19 65 7 26 Extinction 
12 28 65 9 19 Sth Crucial 


with two practice problems; one control problem (which 
could be solved by a long or a short method of solution) 
five Einstellung problems soluble only by the long 


method; four crucial problems which, like the control 
problem, were amenable to either method of solution; 
one extinction problem, not soluble by the long method; 
and finally one more crucial problem. The Einstellung 
score is the number of crucial problems solved by the 
long method, and could attain a maximum value of 5 

Water-jar problems. These problems were presented 
in the same control-crucial sequence as the verbal 
mazes. The problems used are given in Table 1 below 
Administration of the problems was in booklet form to 
the groups of Ss in a manner identical with that used 
in the verbal mazes. To take into account the pos 
sibility that perception of the correct solution in one 
series of problems may transfer to the solving of the 
second series, one half of each group commen ed with 
the verbal mazes and then went on to the water-jars 
problems. The sequence was reversed for the other half 
of each group. Thus while transfer effect was not elim 
inated, it is presumed to have influenced the group 
mean scores equally for both problems 

Role Construct. Repertory Test. The RCRT was ad 
ministered with 11 definitions and 22 triads. The defini 
tions used were as follows 

1. “Myself” 

2. Father or vurrogate 


3. Mother 

4. Brother 

5. Sister 

6. Opposite sex peer most understood 


7. Opposite sex peer least understood 
8. College professor most understood 
9. College professor least understood 
10. Same sex peer most understood 
11. Same sex peer least understood 
The triads were selected randomly with the restriction 
that each defined figure was to be categorized six times 
From each S, a total of 22 category responses were 
obtained, with the possibility that any category might 
appear more than once. The categories used by each 
S were typed on separate cards and numbered to 


identify the protocol from which they were taken. As a 
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result of the considerable number of duplications found 
the total of 


maximum of 


categories 
1,364. The 


groups 


within and between protocols 


was well below the possible 
cards were then inspected and sorted into two 


by each of 3 judges in turn, on the basis of the following 
criteria 
Nond ynamu 


categories. A group including all those 


categories based upon: (a4) Physical appearance, health, 


height, weight, et« b) Kindred relationship, degree 
of acquaintance, etc.; (c) Legal status, sex, age, o 
cupation; (d) Geographical, cultural, or social group 
membership; (¢) Presence or absence of a specific skill 


or achievement 


Dynomic categories. A group including all categories 


referring to personality in terms of needs, motives, 


Examples 


drives, characteristic behavior patterns, et« 
would be aggressive intelligent,’ ‘ambitious,”’ 
“anxious and the like 

he judging was on a forced-choice basis, as each 


category card was to be placed in one of the two groups 
Where the 
tions, the majority decision was accepted 
the 


judges’ evalua 
Each judge 
judgments 


lifferences arose between 


operated independently collating of 


being done by the experimenter. Interjudge agreement 


was computed in terms of percentage ol agreement on 


the total cards judged. The lowest agreement between 
any two judges recorded was 91.1; the highest was 
99.0. Under the circumstances it seems reasonable to 
conclude that the criteria are reliable as bases for 


judgment 

The judging completed, each S was then scored with 
respect the he 
categories between 


to number of times which used non 


dynam to discriminate hgures 
possible source was 22. Scores obtained 
ranged from 0 to 19. A range of 0 to 3 included the 31 


while the high scores ranged from 5-19 


rhe maximur 


lowest scores 
The mean scores of these two groups were 1.8 and 9.3 


respectively 


RESULTS 
Che mean Einstellung scores of both groups 
on the two problem-series are presented in 


Table 2. Analysis of 
statistically significant difference in respect to 


these data revealed a 


rABLE 2 
MEAN EINSTELLUNG Scort IN RELATION TO 
ACHIEVEMENT ANXIETY AND TO USE OF 
yoyNamic (ND) CATEGORIE 
\ Einstellung Test Means 
(,roul 
; Water jars* Mazes** 
Achievement anxiety 
High ND 17 2.13 2.20 
Low ND 14 2.25 2.07 
Control 
High ND 14 1.55 1.80 
Low ND 17 1.44 1.63 
* For achievement anxiety vs.‘control, ¢ 1.29, 2 
for High ND, ¢ 1.9 (ne 
** For achiewement anxiect vs mtr i = 1.97 for 


High ND vs. Low 
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the achievement-anxiety variable on the water- 
jars problems. There were no other significant 
differences between means. In no instance did 
an SS use the long method for solution of the 
final (postextinction) crucial problem. 

In view of the fact that the High Nondy- 
namic (ND) grouping includes a wide range of 
scores, it was deemed advisable to ensure that 
the differences in problem scores were not 
being contaminated by initial differences in 
ND scores between the Control High ND and 
the Achievement-Anxiety High ND subgroups. 
Iherefore the 20 highest and 20 lowest ND 
scores and their distribution 
between the four groups was tabulated. A 


were selected 
chi square derived from this distribution was 
nonsignificant. It is therefore assumed that the 
differences found between groups can be at- 
tributed to the obvious variables and not to 
chance maldistribution of the High ND Ss 
between the control and experimental con- 


ditions. 


DISCUSSION 


The data obtained in this study do not 
support the hypothesis that individuals who 
show nondynamic tendencies in their percep- 
tions of other people are more susceptible to 
the Einstellung effect in cognitive problem- 
solving situations. Some support is offered for 
the conclusion that the presence of achieve- 
ment anxiety creates this susceptibility in Ss 
irrespective of their tendencies to dynamic or 
nondynamic perception in interpersonal rela- 
No evidence of any interaction effects 
the obtained difference was 


tions 
was found and 
significant only for the water-jars problems. 
These findings appear to be relevant to the 
broader issue of generality in rigidity. The 
issue is, essentially, whether rigidity is to be 
regarded as a dimension of a given person’s 
behavior that underlies many diverse ac- 
tions, from social relationships to mathematical 
calculations, or as relatively sper inc toa given 
situation. The latter alternative implies that 
rigid behavior might be elicited from anyone in 
that situation. It appears that the data re- 
ported above support the latter rather than the 
former In this run 
counter to the findings of Brown in a similar 
reports a 


OSItION respect they 
| I ; 


experimental situation (2 srown 
significant correlation between scores on the F 
and Einstellung scores when Ss were 


] 
scaie 








it 
d 
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working under conditions designed to arouse 
achievement anxiety, but not under “‘relaxed’”’ 
conditions, with the implication that rigidity 
functions as a “‘preferred’”’ defense for some 
persons, being elicited by any stimulus per 
ceived as threatening to status or prestige. In 
the absence of such threat, the rigidity would 
be either minimal or absent 

The fact that this relationship was not 
found in the present study may be due to a 
number of factors. First, Brown did not use a 
control problem in his water-jars series as he 
found that its omission maximized the 
Einstellung effect. The fact that a control 
problem was used in the present study may 
contribute to an explanation of the different 
results Whether or not to use a control 
problem depends upon the necessity of observ 
ing the susceptibility of Ss to set induced 
within the confines of the laboratory situation 
A second factor is related to the arousal of 
achievement anxiety. In Brown’s study, “‘the 
experimenter (£) maintained an extremely 
grave, aloof manner, dressed formally and 
conservatively and repeatedly cautioned Ss 
against looking at their test materials ahead of 
time,’”’ and suggested that “‘the tests to be 
taken were measures of intelligence and moti- 


vation and the results were of great impor 


tance.”’ The present experiment, in contrast, 
I | 

related performance on the tests to a rather 

specific punishment-reward situation —namely 


a grade on a course in which Ss were enrolled, 
and may have created a higher level of achieve 
ment anxiety. Perhaps when the anxiety is 
ifficiently intense, even the most flexible of 
indi iduals may show rigid behavior When 
} 


this happens, differences between individuals 


which may be evident in less threatening 


ituations become obscured. Brown comments 
that vhen the authoritarian 1s presented with 
the arithmetic problems in a very relaxed 


social situation he solves them as easily as the 


equalitarian.”” To this might be added the 
comment that when the equalitarian is pre 
sented with the arithmetic problems in a situ 
ation that |! ufficiently anxiety arousing, he 
Solve them no more easily than the author 
itarian. The relationship between stress, anx1 
ety, and rigidity has already been reported in 
the literature (3, 4, 7, 10 

The problem of generality in rigidity may 
be considered in terms of the dimensions along 


which rigidity is assumed to generalize If 


situations involving two or more pencil-and- 
paper cognitive problems show significant 
correlations within a given group of Ss then, 
it is possible to infer a factor of rigidity oper 
ating in Ss’ responses to this type of problem 
It may be possible to predict that rigidity will 
be elicited in a given S’s response to other simi 
lar paper-and-pencil cognitive problems. Such 
evidence seerns to be an inadequate basis, 
however, for predicting behavior in social situ 
ations. This position resembles that of Cowen, 
Wiener, and Hess (5) who obtained significant 
correlations between the water-jars problems 
and alphabet-mazes, but point out that such 
correlations cannot be regarded as evidence 
that there is a “relationship between rigidity 
and personality ” ex ept in the precise exper 
mental situation in which the correlations were 
obtained. With this point of view the present 
writer finds himself in close agreement 


SUMMARY 


Two groups of Ss, 31 in each group, com- 
pleted two Einstellung-test problem series. 
One group was tested under standard condi 
tions, the other under conditions calculated to 
arouse achievement anxiety. The Ss were then 
scored on a variable of dynamic or nondynami« 
perceptions of other people as defined by 
Adorno and his co-workers (1). The measure 
was derived from a modification of Kelly’s Role 
Construct Repertory Test. This variable was 
found to be unrelated to the Einstellung-test 
scores, whereas a significant relationship was 
found to achievement anxiety. The results are 
interpre ted as generally favorable to the view 
that situational or field variables play a sig 


nificant vole in the produc tion ol rigid behavior 


REFERENCES 


1. Aporno, T W FRENKEL BRUuNSWIk ELss 
Levinson, D. J., & Sanrorp, R. N. The au 
thorilarian personality, New York: Harper, 
1950 

?) Brown, R. W. A determinant of the relationship 
between rigidity and authoritarianism. J. abnorm 


c. Psychol. 1953, 48, 469-476 


3. Curistie, J. R. The effects of frustration on rigidity 
in problem solution. Unpublished doctor's dis 
ertation, Univer. of California, 1949 

4. Cowen, E. I The influence of varying degrees of 

chological stress on probiem-soiving rigidity 


J. abnorm. soc. Psychol., 1952, 47, 512-519 
Cowen, E. L., Wrener, M. & Hes Jupirn 
Generalization of problem-solving rigidity. J 


onsult. Psychol., 1953, 17, 100-176 








10 


BRENDAN A. MAHER 


Fisner, S. Patterns of personality rigidity and 
some of their determinants. Psychol. Monogr., 


19% 
Harri 
men 
A.P 
Honr, 
com 


thes 


KeLLy, 


), 64, No. 1 (Whole No. 307) 

Ss, R. A. The effects of stress on rigidity of 
tal sets in problem solution. Paper read at 
A.. Washington, D. C., Sept., 1952 

LD). E. Studies in the role concept repertory 
eptual consistency. Unpublished master’s 
is, Ohio State Univer., 1951 

G. A. The psychology of personal constructs 


New York: Norton, 1955 
Kenvic, Isapette, V. Studies in perseveration 


Il 


Determining factors in the development of 


compulsive activity. J. Psychol., 1937, 3, 231 
246. 

11. Locuins, A. On recent usage of the Einstellung- 
effect as a measure of rigidity. J. consult. Psychol, 
1951, 15, 89-94 

12. Roxeacn, M. Generalized mental rigidity as a 
factor in ethnocentrism. J. abnorm. soc. Psychol, 
1948, 43, 259-278 

13. Suttivan, P. Conceptual properties governing the 
categorization of people. Unpublished doctor’s 
dissertation, Univer. of California, 1950 


Received March 5, 1956. 





he 
r’s 


ON THE RELATION BETWEEN THE PROBABILITY OF A WORD AS 
AN ASSOCIATION AND IN GENERAL LINGUISTIC USAGE 


DAVIS HOWES 


Center for Advanced Study in the Behavioral Science 


HE word-association experiment, de 

veloped primarily as a clinical tool, is 

now being used extensively as a method 
for the quantitative analysis of linguistic be- 
havior (1, 3, 5, 6). Its value in such investiga- 
tions depends to a considerable degree upon the 
extent to which results of the experiment can 
be generalized to language emitted under other, 
more natural conditions. The criticism is often 
advanced, for example, that the specialized 
nature of the instructional set and other con- 
ditions attendant upon the experimental situa- 
tion invalidates the association experiment as 
a technique for studying general linguistic 
usage. 

The question can be resolved empirically 
only by comparing specific properties of lan- 
guage produced in the association experiment 
with corresponding properties of language 
produced under conditions free of the con- 
straints of the experimental situation. One 
property that has been studied is the distribu 
tion of word-association frequencies, which 
Skinner (7) has shown to approximate Zipf’s 
law as closely as do the distributions of word 
frequencies taken from samples of continuous 
discourse (11). The comparison is here ex 
tended from the form of the distributions to 
the actual frequencies with which individual 
words occur in both types of sample 

To perform this comparison we need a list 
ing of all different words with their frequen 
cies of occurrence in general usage in one col 
umn and their over-all frequencies of occur 
rence as responses in the word-association 
experiment in a second parallel column. If the 
correlation between the frequencies in the two 
columns is perfect, the language of the word 
association experiment may be said to be 
stochastically equivalent to the language of 
general usage. In that event we could con- 
clude that the special conditions introduced by 
the association experiment do not affect a 
person’s selection of words. The effect of a 
stimulus word in the association experiment, 

‘On leave from Operational Applications Labora 


tory, AF Cambridge Research Center. Present addre 
Department of Economics and Social Science, M.I.1 


on this hypothesis, is merely to redistribute the 
factors underlying the emission of words with- 
out changing them. If, on the other hand, the 
frequencies in the two parallel columns are so 
disparate that it is impossible to predict one 
set of frequencies from the other, we would be 
forced to conclude that the experimental situa 
tion introduces fundamental changes into the 
processes underlying a person’s selection of 
words. An empirical determination of the cor 
relation between the two columns will permit 
us to decide which of these two conceptions 


of the association experiment applies 


DEFINITIONS 


An explicit statement of the problem requires 
more careful definition of concepts than has 
been traditional in this field. In this section 
the concept of stochastic equivalence of lan 
guage samples is developed from definitions 
based upon certain idealized linguistic experi- 
ments 

We restrict ourselves to the written language 
produced by some specified population of per 
sons O. It is assumed that the written language 
emitted by O can be dee omposed into sequences 
of occurrences of a set of discrete spatial pat 
terns or letters. Any permutation of letters 
bounded by spaces is a word, and the set of all 
possible words for’a given set of letters is the 
vocabulary. Thus for English the vocabulary 
is the denumerable set of words A, B, . 


Z, AA, AB, , ZZ, AAA, AAB, , bb, 


A word is said to be emitted in the free 
situation if at the time of its emission the be 
havior of O is in no way affected by the ex 
perimenter. An occurrence of the jth word of 
the vocabulary is written w, if it is emitted in 
the free situation 

Consider a fixed time interval 7, called the 
base interval, during which a random sample 
from the population O emits words in the free 
situation. The probability of obtaining an 
occurrence of the ith word in a random draw 
from the sample of words emitted in 7 is the 


base probability of that word, P(w,). The base 
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probabilities of all words in the vocabulary 
define a vector P, 


P(w;), 


2 ,P(w;) a 


Pp [P(w, P(ws 


(1) 


which is called the base probability vector. This 
vector is fixed uniquely by a population O and 
a base interval 7 

Similar definitions apply to the written lan- 
guage produced in a word-association experi 
ment. Let another random sample from the 
same population O be given a standard associa 
tion experiment (4, p. 38) during randomly- 
selected segments of the interval 7. The sym 
bol r,; will be used to represent an occurrence 
of the jth word as a response (association) in 
such an experiment in order to distinguish it 
from a word emitted in the free situation. A 
stimulus presentation of the ith word by the 
experimenter is denoted by s, Che probability 
of obtaining an occurrence of the jth word 
in a random draw from the sample of associa- 
tions given in response to a stimulus presenta- 
tion of the ith word is the associative probability 
of the jth following the ith word, Pai(73) The 
associative probabilities of all words in the 
vocabulary following presentation of the ith 
a vector pi, 
eclor for the ith word, 


word define called the associative 


probabili ly 


Ta | 
Pp, Ps Vij, Pat: ’ Poil%;), }y 
(y.) 1?) 
Pu\t 0, 4] 
LT ipl? 1 


Che ith term of this vector, p,,(7,), is always 


zero, since the standard experimental! procedure 
does not permit the subject to give the stimulus 
word a his 

To establish the « 
Pp and 


we consider the vector of conditional probabili- 


response 
the 
vector P 


mnection between 


ative vector the base 


ties of words emitted in the free Situation, 


P, = [P.,(w:), Poi(ws), «++, Po(w;), ++], 
[3] 
> jPo(w;) 1 
Since 
P(w,; = Plu Po (w,;), [4] 
the base vector P represents the weighted aver 
age ol the vectors P, 
P = P, = Y,P(w)P, (5] 


In order to obtain a comparable quantity 
for words emitted in the association experi- 
ment we define a base associative probability 
vector as the identically weighted average of the 
vectors pi, 


p= Pp = L.P(w,)pi. [6] 


One can think of the base ‘associative prob- 
abilities as the frequencies of occurrence of 
words as responses in an association experi- 
ment in which the stimulus words represent a 
random sample of the words the experimental 
subjects would have emitted during the same 
interval of time had they been left in the free 
situation. In practice, no word is presented as a 
stimulus more than once in a given association 
experiment. Each probability 
vector p; therefore has to be estimated from 
responses single experimental 
presentation of the ith word. It is therefore 
convenient to compute the base associative 
vector from the product of the matrix of asso- 
ciative probabilities [p,, po, ---, pi, «* +] post- 
multiplied by the column vector P 


associative 


made to a 


P= Pi = (P,P -*:, Py, JP. [7] 

The concepts introduced at the outset of 
the 
definition. Stochastic equivalence between the 


paper can now be given more precise 
language emitted in the association experi- 
ment and the language emitted in the free 
situation is defined by equality of the vectors 
P and p. When these vectors are defined as in 
Equations 5 and 6 the equivalence is said to 
the 
equivalence thus implies that, for every pair 


be of first order. First-order stochastic 


(i, 7) of words in the vocabulary, 


ri P(w;) Po (wy;) 


2,P(wi)pu(r;). {8} 


Stochastic equivalence of any higher order can 
be defined by substituting conditional and 
associative probabilities of a higher order for 
those of the first order in Equations 2 and 3. 
Stochastic equivalence of the second order, for 
example, would imply in analogy to Equation 
8 that 


Li P(wi;j) Poi (wr LisP(wis)Pris(te) [9] 


for all triples (i, 7, &) of words in the vocabu- 
lary. In the present study we shall be con- 
cerned exclusively with first-order relation- 
ships. It is important, however, to recognize the 
possibility that the results obtained for the 
first order may not apply to higher-orders. 








] 
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The equation P = p is trivial if the associa- 
tive probabilities of all word pairs are equal to 
their corresponding conditional probabilities, 
i.e., if for all pairs (i, 7) of words, 


Pei\75) - Py, (wy), 
Pen( Tr) = (), 


Po, (wr) = 0, Lj Pw;(w;) - 


Li pal; - S, [10] 


Now, there are no data from which the condi 
tional probabilities P,,(w,) can be estimated 
accurately. But inspection of the data from 
any word-association experiment is alone suf- 
ficient to show thatin fact associative probabili- 
ties differ greatly from conditional probabilities 
for at least some word pairs. To cite one exam 
ple, the associative probability of the word 
light following the stimulus word dark is found 
experimentally to be about 0.4, while the 
probability that light follows dark in the free 
situation (i.e., the probability of the sequence 
dark light over the probability of dark) is 
clearly of a lower order of magnitude. Thus the 
real question at issue is whether the average 
associative probabilities are equal to the aver- 
age conditional probabilities even though the 
individual associative and conditional probabili- 
ties differ greatly. 

The method used here to test for stochastic 
equivalence is to calculate the coefficient of 
correlation between empirical estimates of 
P(w,;) and 2,P(w,)p,,(r;) for as large a sample 
of words as possible. A correlation coefficient 
of one implies perfect stochastic equivalence; 
a correlation of zero, perfect stochastic in- 
dependence. The square of the correlation 
coefficient measures the proportion of the 
variance of the empirical data that is due to 
the relationship between the two sets of fre- 
quencies. It is thus a measure of the extent to 
which the hypothesis of stochastic equivalence 
accounts for those data. The square of the 
correlation also indicates the precision with 
which the probabilities of words in the free 
situation can be estimated from the laboratory 
situation of the association experiment. In 
this sense, it may be interpreted as an index 
of the potential value of the association experi 
ment as a general laboratory method for the 
statistical analysis of language 


MEASUREMENTS 


Measurements of base probabilities and 


associative probabilities for the same popula 


tion of subjects have not been published. For 
base probabilities the most extensive set of 
data published is the Lorge magazine count 
(8). This is an analysis of 4.6 million words of 
text from five popular American magazines 
issued during the period 1927-1938. For asso 
ciative probabilities the standard tables are 
still those published by Kent and Rosanoff in 
1910 (4), which list the associations spoken to 
each of 100 stimulus words by a population of 
1,000 subjects, including 500 hospital workers, 
200 professional persons and 150 high school 
students. Since the two sets of data differ with 
respect to population of subjects, time when 
measured, and mode of response (written or 
spoken), an estimate of the correlation between 
base probability and base associative prob- 
ability based on these tables cannot be ex 
pected to approach unity even in the case of 
stochastic equivalence. 

The Kent-Rosanoff tables list each com- 
bination of letters as a separate word, but the 
published version of the magazine count com 
bines the entries for certain inflected forms 
(e.g., plurals ending in s, verb forms ending in 
ed or ing) with the root forms. The correspond 
ing entries in the Kent-Rosanoff tables have 
been combined in the present analysis to make 
the two sets of frequencies comparable 

The stimulus words used in the Kent 
Rosanoff study constitute a very unrepresenta- 
tive sample of the set of all possible words. All 
are nouns and adjectives, for example, and all 
are words of medium frequency (their median 
log frequency is 2.6 with a standard deviation 
of 0.5 log unit). Weighting each associative 
frequency by the magazine-count frequency 
of the stimulus word appears to be of limited 
value in such a case. On the other hand, the 
weighting procedure greatly increases compu 
tational labor. Computation of base associa- 
tive probabilities has therefore been simplified 


by using the approximation 
LP (wi) pas 3) ~ 0.012 4p, ,(7;) [11] 


in which the 100 stimulus words receive equal 
weights 

rhe notation may be simplified somewhat 
for the purpose of describing the data by drop 
ping the indicators w, s, and r which distin- 
guish the various functions of words in the 
experiments. The base probability is then 


pi 


denoted by (7) and the base associative prob- 


ability by 2,P(i)p(j). The empirical estimates 
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of these theoretical quantities will be referred 
to as the magazine-count frequency F(j) and 
the summed Keni-Rosanoff frequency Xf,(j), 


respectively. 
ANALYSIS OF DATA 


Figure 1 is a plot of the relation between 
magazine-count frequency and summed Kent- 
Rosanoff frequency on logarithmic coordinates. 
The numbers bordering the scatter plot repre- 
sent the numbers of words having frequencies 
of zero, whose logarithms are therefore nega- 
tive infinite. A significant positive correla- 
tion is evident from the figure, but the rela- 
tionship is by no means rectilinear over the 
entire range. The form of the regression lines, 
as well as the magnitude of the correlation, is 
therefore of interest. Both types of analysis, in 
turn, require that the form of the conditional 
distribution functions be known 

Variation of base associative probability with 
base probability constant first the 
distributions in vertical di- 


Consider 
the 
These distributions show 


conditional 
mension of Fig. 1 
given 
magazine-count frequency occur 0, 1, 2, 

times as the Kent-Rosanoff 


Figure 2-A presents several such dis 


what proportion of words having a 
responses in 
tables 
tributions on logarithmic-normal coordinates. 
A straight line on these coordinates indicates 
that the logarithms of the variable plotted on 
the abscissa are normally distributed with a 
mean indicated by the intercept of the line 
deviation inversely propor- 
tional rhe sets of points in Fig 
2-A are well described by a set of lines that are 
both straight and parallel, indicating that the 
conditional distributions of log base associa- 


and a standard 


to its slope 


tive probability for given base probability are 
both normal (Actually, 
there is a slight increase in the standard devia- 


and homoscedastic 
tion of log Zf,(7) at the highest magazine-count 
frequencies, but this is due to the use of wider 
class-intervals in that range in order to obtain 
distributions of more than 100 cases.) 

rhe summed Kent-Rosanoff 
frequency on frequency is 
shown by the solid circles in Fig. 3-A. The 
coordinates Medians have 
been substituted for means in the regression 
analysis because the distributions include zero 
frequencies the which are 
rhe regression line has a maximum at 


regression of 
magazine-count 


are logarithmic 


logarithms of 


infinite 


a magazine-count frequency of about 800 and 
falls off sharply on both sides of that point. 
It may be observed that the regression line 
in Fig. 3-A covers only about half the total 
range of log F(j). Words that are rare in the 
magazine count tend not to occur at all in the 
Kent-Rosanoff tables, which comprise a much 
smaller sample. When the magazine-count 
frequency is small (< 150), the median summed 
Kent-Rosanoff frequency is therefore zero 
and the median log 2f,(j) is negative infinite. 
At the very highest magazine-count frequencies 
more than half the words again fail to occur in 
the Kent-Rosanoff tables, so that the median 
log =f:(j) is negative infinite in that range too. 
An estimate of the regression line over the 
full range can be found by using a theorem 
which states that the line connecting a given 
fractile of each of a set of normal, homoscedas- 
tic distributions is parallel to a line connecting 
any other fractile of the same set of distribu- 
tions.” Since the highest fractiles of the condi- 
tional distributions of log Zf,(7) are finite for 
all magazine-count frequencies, the slope of 
the regression line can be estimated from the 
slope of a line connecting one of those fractiles. 
In Fig. 3-A the 95th and 75th percentiles of 
the log Zfi(j) distributions are indicated by 
crosses and triangles, respectively. These data 
indicate a regression line that is rectilinear and 
positive for magazine-count frequencies of less 
than 800 (log F(j) < 2.9). (The leveling-off 
of the 95th percentiles for log F(j) < 1 is due 
to an artifact explained in the following para- 
graph.) Above that point the regression line 
undergoes a sudden reversal until at the highest 
frequencies its slope ap- 
The significance of this re- 


magazine-count 
proaches* — ©, 
versal will be taken up in a later section. 

The leveling-off of the 95th percentiles for 
log F(j) < 1 is due to the omission of a number 
of words in that range. The magazine count is 


7A proof may be indicated for a set of normal 
distribution functions G(x), Ge(x), , G(x) having 
standard deviations o, = o:; = ¢, = o. Let a and b 
be two proportions, 0 < a < 6 < 1, for which the frac 
tiles of the various distribution functions are x;(a), 
r2(a), , x(a) and 2,(b), , 2n(b), respec 
tively. If h(a) and A(b) are the corresponding fractiles 
of the standardized normal! distribution function, we 


r,(b), 


can write 2;(b) x(a) = [h(b) h(a)|a = x(b) — 
ty(a) = x,(b) x,(a). A line connecting the points 
r,(a), x(a), , x,(a) will therefore lie [4(b) — A(a)\o 


units of x below a line connecting the points x,(d), 
r2(b), x,(b). Hence the two lines will be parallel 


’ 
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Ordinate is proportional to the normal] deviate, abscissa is logarithmic. The distributions have been separated 





by displacing each one to the right one log unit 


published as one of four word counts in the 
Thorndike-Lorge tables (8). These tables do 
not list the frequency of a word in each count 
if the combined frequency of the word in all 
four than 18. Moreover, the 
combined frequency itself is given only if it is 
than 3. In the data of Fig. 1 only 
magazine-count frequencies specifically listed 
used. Words with combined 


counts is less 


greater 


in the tables are 


frequencies of less than 18 are thus assigned a 
magazine-count frequency of zero in Fig. 1 
even though they may actually have occurred 
up to 17 times in the magazine count. 

The number of these words actually having 
a given magazine-count frequency can be es- 
timated approximately by assuming that the 
occurrences making up the combined fre- 
quency of a word are distributed randomly 
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Fic. 3. A. REGRESSION OF SuMMED KENT-ROSANOFF FREQUENCY ON MAGAzINE-COUNT FREQUENCY 
B. REGRESSION OF MAGAZINE-CoUNT FREQUENCY ON SUMMED KENT-ROSANOFF FREQUENCY 


Logarithmic coordinates. Solid circles, triangles, and crosses represent 50th, 75th, and 95th percentiles, re 
spectively. Open circles indicate corrected data as described in text 


among the four equal component counts.’ The 
summed Kent-Rosanoff frequency can be 
found for each word with a known combined 
frequency and assigned on a proportionate 

+ Let Q be the combined frequency of a word in all 


four counts and q;, 92, gs, and gq, the unknown com 


ponent frequencies, any one ol which may be chosen 


to represent the magazine-count frequency. Then 
Lids Q by definition and the limits 4 < Q < 17 
und 0 < qj < 17 follow from the nature of the Thorn 


dike-Lorge tables. The number of possible arrange 
ments of the four component frequencies that satisfy 
these conditions for given Q may be denoted by Ag, 
and the number of these arrangements that assume 
a frequency m in the component representing the 
magazine count, Am)g. The probability that a word 
with a combined frequency Q has a magazine-count 
frequency m may be estimated from the ratio Amig/Ag 
Suppose Ng words are listed with combined frequencies 
of Q in the Thorndike-Lorge tables. Then ttnig = 
Ne(Amiq/Ag) estimates the number of words with 
combined frequencies of Q that have magazine-count 
frequencies of m. The estimated number of words with 
magazine-count frequency m that are omitted from 
the published Thorndike-Lorge tables is therefore 


bs 7 ’ 
tm = Lotmig = ZeNolAmie/Ag 


basis to the estimated magazine-count fre- 
quencies. In this way the words omitted from 
the published tables can be assigned values of 
F(j) and 2/7). These data can be used to cor- 
rect Fig. 1. 

The 95th percentiles of the Z/,(7) distribu- 
tions for these corrected data are shown by the 
open circles in Fig. 3-A (no correction is indi- 
cated for magazine-count frequencies of 12-17 
where the change was not appreciable). The 
corrected data fall quite close to the theoretical 
straight line except for magazine-count fre 
quencies of three or less (log F(7) < 0.5). In 
that range the correction is only partial be- 
cause the published tables do not report words 
with combined frequencies of less than 4. The 
partial correction has a large enough effect, 
however, to indicate that complete correction 
would eliminate the remaining deviations of 
these points from the rectilinear regression 
line. 

Variation of base probability with base associa 
tive probability constant. Consider next the con 
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ditional distributions in the horizontal dimen- 
sion of Fig. 1, which show the proportion of 
words with a given frequency in the Kent 
Rosanoff tables that occur 0, 1, 2, 
Figure 


times 
shows 


2-B 
several of these distributions on logarithmi 


in the magazine count 


normal coordinates. In this case the sets of 
points do not describe parallel straight lines as 
2-A. Two artifacts, 
these data: the omission of words with maga- 
than 18, 


scribed in the preceding paragraphs; and the 


in Fig however, affect 


zine-count frequencies of less de- 
reversal of the regression line for log Z/,(7) on 
log F(7), shown in Fig. 3-A. When the effects 
of these two factors are taken into account, the 
log prob 


indicated by the straight lines in Fig. 


theoretical distributions of base 
ability, 
2-B, turn out to be normal and homoscedasti 
rhe first factor is responsible for the leveling 
off at the lower ends of the distributions of 
Fig. 2-B (the effect does not show up in the 
distributions because they 
contain affected 
has been pointed out that words with combined 
frequencies of less than 18 in the four word 


counts of the Thorndike-Lorge tables had to 


two right-hand 


no points in the range). It 


be assigned a frequency of zero in Fig. 1 a! 
though their true magazine-count frequen 1eS 
This produces an excess 


may be as high as 17 


of words in the interval F(7) 0.5 and a cor 
responding deficiency in the interval 0.5 

F(}) 
2-B, therefore, 
(—0.3 on 
their 
17.5 (+-1.2 on the logarithmic abscissa) 
rhe 


scribed in connection with Fig 


17.5. In the cumulative plot of Fig 


for F(7) 0.5 


abscissa) lie wel! 


the points 
the logarithmic 
above true values, while those for 
F(4) 
are correct correction procedure de 
3-A eliminates 
this type of deviation altogether 

rhe factor, the 
gression Fig. 3-A, 


at the upper ends of the dis 


the re 
s responsible for 


ecom reversal of 
line of 
the deviation 
tributions of Fig. 2-B. The effect of the reversal 
is to lower the positions of points at the ex- 
treme right of Fig. 1 from the positions they 
would occupy if the regression line of Fig. 3-A 
At log 
F(j) 3.0 the median point is lowered about 
0.1 log log F(j 
unit; and at log F(j 

a horizontal slice through the 


were rectilinear over its entire range 


unit; at 3.5, about 1 log 
4.0, about 2 log units. 
Consequently, 
upper part of Fig. 1 has a large proportion of 
> 3.0 removed 


A dis- 


its points in the range log I’(7) 
by the reversal of the regression line 





tribution of log F(7) at this level therefore 
has the form of a truncated normal! distribution 
(2, §6.9). Distributions of log F(j) taken at 
successively lower levels of Fig. 1 have pro- 
portionately fewer points removed by the re- 
versal. The degree of truncation should de- 
crease accordingly. At the very lowest levels 
of Fig. 1, in fact, the reversal actually adds 
more points from higher levels than it removes 
to still lower levels. The net effect at these 
levels is thus to add a small number of points 
in the affected range. 

In Fig. 2-B, distributions of log F(j) for 
successively lower levels of log =/,(7) are ar- 
ranged from right to left. All six sets of data 
can be brought into accord with the theoretical 
straight lines by replotting them for the fol- 
lowing assumed degrees of truncation (read- 
ing from right to left): 55%, 35%, 18%, 9%, 
0%, and —10% (negative truncation indicates 
the addition of points instead of their removal). 
For each set of data the degree of truncation 
that gives the best fit to the theoretical line is 
of the order of magnitude required by the data 
of Fig. - 

The medians of both empirical and theoret- 
ical distributions are used in Fig. 3-B to indi- 
cate the 
quency on summed Kent-Rosanoff frequency. 
Solid circles represent medians of the empirical 
(truncated) distril the 
medians of the tiueoreticai (normal and homo- 


regression of magazine-count fre- 


utions, open circles 
scedastic) distributions. (For log =f,(7) < 1.3 
the solid circles represent both sets of medians. ) 
Each set of data is well described by a straight 
line, the slope for the empirical medians (solid 
line in Fig. 3-B) being slightly steeper than the 
slope for the theoretical medians (dashed line). 

Since the the dis- 
tributions have to be estimated the 
theoretical lines of Fig. 2-B, their accuracy is 


medians of theoretical 


irom 


open to question. As a test of their validity, 
the slope of the line fitted to the theoretical 
medians can be estimated by a second pro- 
cedure. The truncation of the empirical dis 
tributions of log F(7) does not extend to the 
lowest fractiles. Hence a line passed through a 
fractile of the empirical distributions 


should have the same slope as the dashed line 


low 


of Fig. 3-B representing the medians of the 
theoretical distributions.? The line of best fit 
to the 25th percentiles of the empirical dis- 
tributions has a slope of 1.45 + 0.1, compared 
to 1.40 + 0.1 for the dashed line of Fig. 3-B. 
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By way of comparison, the dashed line of Fig. 
3-B, representing the medians of the empirical 
distributions, has a slope of 1.85 + 0.1. 
Correlation between base probability and base 
associative probability. Only a small proportion 
of the words represented in Fig. 1—perhaps 
0.5 %—are involved in the reversal of the re- 
gression line in Fig. 3-A. The behavior of all 
other words is described by the rectilinear re 
3-B (dashed 
line). For these data, which satisfy the assump- 


gression lines of Fig. 3-A and 


tions of a normal correlation surface (2, p. 
585), the product-moment correlation coeffi- 
cient p can be used to measure the degree of 
association between base probability and base 
associative probability. Numerical computa- 
tion of p is impossible because of the presence 
of infinite scores. The correlation can be deter- 
mined from the angle 6 between the two re- 
gression lines, however, since 


p }0102 


tan 6 > 

ploy + a) 

(2, Equation 19.7.12). 

6 is 3.5 degrees (Figs. 3-A and 3-B), a; is 1.10 

log units (Fig. 2-A) 

(Fig. 2-B). The corresponding estimate of p is 
+-2).937 + 0.001. 

This correlation must 


For the present data, 


and az is 0.75 log unit 


’ 


be interpreted in the 


light of several adverse factors affecting the 
data: the difference of 30 years between the 
dates of the two sets of data; the different 
composition of the two populations of sub 
jects; the contrasts between the written lan 
guage of the magazine count and the spoken 
language of the Kent-Rosanoff 
the small number of stimulus words tested in 
the Kent-Rosanoff study; and the use of the 


experiment; 


approximation to base associative probability 
All these factors 
can be expected to introduce uncorrelated 


instead of its exact definition 
variance into the results. The true correlation 
between base probability and base associative 
probability, uncontaminated by these sources 
1.0 


Stochastic equivalence thus appears to apply 


of error, must therefore be close to 
to the words considered in this section 
The reversal of regression at high magazine- 


The 


magazine-count 


count frequencies words having the 


highest frequencies are the 
ones that are responsible for the inversion in 


3-A. Table 1 lists the 


25 words having the highest magazine-count 


the regression line of Fig 
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TABLE 1 


SUMMED ASSOCIATIVE FREQUENCIES FOR THI 
CLASSES WITH THE HIGHEST MAGAZINE 
FREQUENCIES 


25 Worp 
OUNT 


F(j) 


Jenkins 
= 19,100 


Magazine Count 


Kent-Rosanoff 
N = 4,591,125 i 


= 100,000 


the 236472 0 28 

and 138672 0 

a 117222 0 

to 115358 0 

of 112601 

I 89489 

in 75253 

was 58732 

that 55667 

it 52107 

he 19268 &2 

you 42581 219 

for 39363 54 

had 34341 42 

is 33404 135 

with 32903 42 
17. her 31824 60 
18. she 31087 70 
19. his 30748 104 
20. as 30693 10 
21. on 30224 o4 
22. at 26250 6l 
23. have 24456 19 47 
24. but 23704 0 27 
25. me 23364 14 242 


535.783 54 
$3451 0.00054 


1943 
0.10171 


Sum 
Sum/N 0 


frequencies along with their summed Kent 
Rosanoff frequencies. Without exception, the 
high magazine-count frequencies of these 
words are matched very 
Kent-Rosanoff frequencies average 
K ent-Rosanoff only 
0.2% of the average magazine-count frequency, 


the difference in size of 


low summed 


The 


frequen y 1S 


with 
summed 


after correction for 
sample 

Study of Table 1 discloses that all the words 
listed there are of a characteristic type. They 
are words whose primary function 1s to connect 
other words in continuous discourse: artic les, 
conjunctions, auxiliaries, prepositions, pro 
nouns. These may be called interstitial words 
The low frequencies of such words in the Kent 
Rosanoff 


grounds 


tables could be expected on two 


First, the subjects in the association 
experiment are instructed to emit only one 
word utterances. The absence of continuous 
discourse eliminates the principal function of 
interstitial words, thereby reducing the num 
ber of occasions on which they are called forth 


Second, it has been well established that sub 
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tend to give associations of the same part 
(10, p 347) 
Phe complete absence of interstitials among the 
100 Kent-Rosanoff stimulus words would there 
fore result in low frequencies for those words as 


ject 
of peech as the stimulus word 


associations. The former interpretation of the 
result implies that interstitial words represent 
a special type of word to which the property of 
stochastic equivalence does not apply, while 
the latter interpretation holds that the excep 
tional behavior of these words is merely an 


artifact of the particular association 


experi 
ment that has been analyzed 
The best evidence on the question is found 


in a recent association experiment by Jen- 


kinst in which over 50 interstitials, including 
all but three of the words in Table 1 (in, her, 
as), were among the 200 stimulus words. Table 
1 shows the frequencies, summed over all 200 
timulus words, with which the words listed 
there occurred as responses in the Jenkins 
tudy. The effect is striking: there is a 200 to 1 
increase in frequency over the Kent-Rosanoff 
experiment. The 25 words of Table 1, while 
they constitute only 0.05% of the Kent-Rosa 


noff responses, make up 10% of the Jenkins 


responses and 33% of the magazine count. 
Inclusion of interstitial words in the list of 
timulus words thus goes a long way toward 


closing the gap between the summed associa 
tive frequencies and the magazine-count fre 
quencies of interstitial words. Use of the com 
plete lormula for base associative probability 
nstead of the approximation would narrow the 
gap even more 


Phe 
than 


Jenkins experiment, based on fewer 


100 subjects, is not extensive enough to 


warrant a more elaborate analysis. But it shows 
clearly that the particular selection of stimulus 
words used by Kent and Rosanoff is chiefly 
responsible for 
Fig. 3-A 
between the Jenkins result 
Rosanoff further 


tochasti 


the reversal of the regression 
The sharpness of the contrast 
and those of Kent 


the possibility 


line uu 
and suggests 
that 


interstitial we 


equivalence may apply to 


rds as well as to noninterstitial 


ones. A large and carefully designed experi 


Jenkins of the University 


Mmcing his 


‘]T wish to thank Dr. J. ] 
of Minnesota 


disposa! and 


for generously lata at my 
Miss Evelyn Segal for preparing the com 


putations 


ment would be req ‘red, however, in order to 


test this possibility adequately. 


SUMMARY AND CONCLUSIONS 


The summed associative probability of a 
word, measured by its total frequency in the 
Kent-Rosanoff tables, is considered in relation 
to the probability of emission of the word in 
general linguistic usage, measured by its 
frequency in the Lorge magazine count. Two 
features stand out in the data: a high positive 
correlation (.94) for magazine-count frequen- 
cies of less than 800; and a sharp reversal in this 
fre- 
quencies. Perhaps neither result, taken singly, 
is as striking as the fact that both appear so 
clearly in the samé set of data. 

The inverse relationship for high magazine- 
count frequencies is produced by a fairly 
small number of words of a special type, 
called interstitial their chief 
function is to connect other words in continu- 
conjunctions, articles, 


relationship for higher magazine-count 


words because 


ous discourse (i.e., 
prepositions, auxiliaries, pronouns). Such words 
were systematically excluded from the list of 
stimulus words used by Kent and Rosanoff. 
As a consequence, they were rarely emitted as 
associations despite their high magazine-count 
Data from association 
experiment, in which stimulus- 
words were included, indicate that the reversal 
of correlation found in the Kent-Rosanoff data 


can largely, if not completely, be eliminated by 


frequencies. another 


interstitial 


an unbiased selection of stimulus words. 
The high found for all 


words indicates that the average probability 


correlation other 
that a given word will be emitted as a response 
in the word-association experiment is the same 
as its probability in general discourse. The 
language of association can therefore be said to 
be stochastically equivalent to the language of 
general discourse. This relationship implies 
that a person’s selection of words is not affected 
by the artificial laboratory situation required 
for the Hence the 
effect of a given stimulus word upona subject’s 


association experiment 
response must be due to a redistribution of the 
set of factors governing emission of words in 


The 


association 


general discourse results thus serve to 
the 


method for studying under laboratory control 


validate experiment as a 


the processes governing the emission of words 
in general discourse. 





to 
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LOOKING AT OCCUPATIONS! 


WILLA FREEMAN GRUNES 


University of California, Berkeey* 


ow does the young American look at 
jobs? What groups of jobs appear in 
psychological field? What do 
these groups look like to the perceiving indi 
Do different these 
things differently? These are phenomenologi- 
cal questions, of the sort MacLeod (9) meant 
when he suggested that the phenomenological 
fruitful 
sical problems in perception, be applied 


his 


vidual? social classes see 


method, found so in approaching 
Cid 
also in the field of social psychology. They con- 
cern the social world which is psychologically 
“there’’ for the subject, quite apart from what 
the experts might say is, or ought to be, there. 

There have been few studies that attempt to 
answer such questions about aspects of the 
social world by attempting to discover per- 


Methods 


ich investigation are woefully lacking and 


ceived units and their characteristics 
jor 
dificult 


ented as suggestive of new ways for mapping 


to invent. The present study is pre 
perceived social worlds 
If we can determine what people see, we can 


think 


upations, we 


better understand what they and do 


} 


Specifically, in the realm of oc 


can better see why they make their particular 
vocational choi es, and why too many choose 
certain types of overcrowded fields while too 
few choose others 

Occupational perceptions were selected for 
this study partly because they seem important 
for revealing the psychological meaning of 
social class memberships. Such meaning has 
long been neglected in research on class in 
America. These perceptions also offer a meeting 
place between psychology and economics and 
hence offer a promising area for mutual enrich- 


ment of both fields 


' The material discussed is adapted from a doctoral 


dissertation (6) done under the guidance of Professor 


Mason Haire. Especially extensive assistance was given 
by H. D. Mugass, R. R. Lange, W. ( Rasmussen, 
Dorothy W. Gottlieb, Katherine M. Cavanagh, Leon 
Festinger, and Camille J. Miller, as well as by the 


administrators and their staffs of the schools involved 
*? The author now resides in Mandan, North Dakota 


bate) 


METHOD 

Sample 

About 150 students in each of eight high schools 
tested in 1952-53. Four Northern California 
schools were included: Oakland, McClymonds, San 
Lorenzo, and Hayward. There were also two South 
TaJledega in Alabama and Orangeburg 
one Northwest Central school: 
Dakota; and one Northeastern: 
The communities of 


were 


eastern schools 
in South Carolina; 
Bismarck North 
Clifton Heights in Pennsylvania 
the school vary widely in size, social class, race, and 
in degree and type of industrialization. The diversity 
seemed desirable in order to assess the generality of 


in 


the findings 


Proce edure 


Two group-administered devices, the Grouping Test 
and the Pick-A-Job Test, were developed and pretested 
for purposes of revealing the way perceived occupations 
tend to be grouped by high school the 
perceived structure of the world of jobs. Both were in 
directly inspired by various studies by Krech [(8) and 
recent unpublished work] on the way rats and also 
In order to make 


students; i.e., 


men structure the world around them 
it easy and threatless for subjects to reveal their per- 
both The 
subject is provided with an incomplete situation, out 


ceptions, devices are disguised on purpose 


f which he must make some sense of his own in order 
Such and 


to deal with it a disguised unstructured 


approach seems appropriate to an area of content in 
which subjects may be unable or unwilling to verbal 
ize what they Through these two devices, a 
perceptual structure was sought which could be re 


per elve 


garded as common to most high school students. The 
that all 
possess a similar structure, 


assumption was nevertheless avoided social 


classes—or all individuals 
or that any perceived structure necessarily reflects ob 
jective social reality. In this respect the method differs 
from that by Lloyd 
(10) to study class structure 

Grouping Test. On the Grouping Test, the subject 
was presented with a list of 51 varied occupations, 
selected to represent each of the major categories in 
(11). He was 
(somewhat 


used Warner and his associates 


the Dictionary of Occupational Titles 
given the following written 
abridged here) along with some oral clarification 

Let us see how fast you can think about various 
Group the following occupations as 
many ways as you can For example, make groups 
according to what kinds of people work at the 
of work they do, etc. Give each 
tells what kinds of jobs belong 
in each 


instructions 


occupations 


jobs, what kin 
group a title tha 
it At 


you make 


under least two jobs must be 
group You may use the same jobs 
under as many For example 


People who would probably know how to give 


titles as you wish 





of 
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first aid: airline stewardesses, doctors, nurses, 
physical education teachers 
The more groups you make within the time 
limit (15 minutes) the higher your score. You can 
use any types of groups you can think of Be 
sure to include all of the occupations on the list 
that belong under each of your titles, because you 
also get a higher score for every job you list that 
belongs there. Here is the list of occupations 
The resulting data were tabulated and coded to 
indicate the jobs which tend to be placed together in 
the same groups, the grouping categories reflected in 
the titles, and the jobs most frequently placed undei 
each type of title. A set of job clusters common to 
most students’ view of the world of occupations was 
discovered. Content analysis of the titles was done by 
two coders working independently, using only cate 
gories which yielded at least 80 per cent agreement 
Since completing the present study, the author has 
learned of a similar effort by Campbell (1) to develop 
a method for establishing subjective occupational 
groupings. His study failed, however, to reveal any 
common job clusters. The present method differs from 
his in permitting the subject to use each occupation 
in as many groups as he wished and to make as many 
(overlapping) groups as he could, as well as in the 
procedures used to analyze the data. These provisions 
appear more appropriate than Campbell’s to the pre 
f 


sumably complex nature of the perceived structure « 
the occ upational world 
Pick-A-Job Test. The Pick A-Jop Test offered the 
subject priel lescriptions of six different people and 
asked him to assume the role of vocational counsellor 
and to suggest five appropriate jobs for each one. For 
example, the first description appeared thus 
\ Man 
good-looking 
ciean 


well-educated 


Ae wh 


sell-conhdent 
Results were analyzed to indicate the typical sorts of 
jobs suggested for each des niption 
I'he personality descriptions used on the Pick-A-Job 
Test were derived from earlier use of what may be 
called the Haire Describe-A-Person method, a modifi 
cation of a procedure originally developed by Haire 
and described by Haire and Grunes (7). High school 
students were given a few rather neutral facts about a 
fictitious person, and also his occupation, and were 
asked to guess the sort of person he was. Their gucsses 
were coded for content and compared with des niptions 
obtained from similar subjects when the occupation 
was oniitted from the given facts. Differences between 
the two sets of des riptions re vealed ste reotypes asso 
ciated with the occupations selected for study. The 
Pick-A-Job Test was based on the typical descriptions 
thus obtained of businessman (Description 1 above 
ditch-digger, actress, and farmer. Additional descrip 
tions based on the typical businessman and dit h-digger 


were included, with the characteristics ascribed in each 
case to a woman 

tesides these procedures, the Gough Home Index 
(4). a measure of relative social status of high school 


students, was administered. It yields a continuous dis 
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rABLE 1 
SoctaL CLass DIFFERENCES IN FREQUENCY OF 
Groupinc Orner Occupations Witna Doctor 


Entries represent percentage of groups involving 
“doctor” made by each social class) 


Home-Index Social Class 


Co-Member Lower Middle Upper 
Nurse 31 66 53 
College Professor 38 59 sO 
Teacher 44 56 73 
Social Worker 13 24 47 
Air Line Stewardess 56 38 33 


tribution of scores, but was here employed to classify 
subjects into three status groups (low, middle, and 
high) by cutting the distribution for each school at 
the two points where the large central frequencies 


began to fall off 


Derivinig the J ob Clusters 


Data were compiled on each job listed on the 
Grouping Test, showing, for each social class, 
the other jobs with which a given job was 
grouped and the types of titles under which it 
was listed.’ As a sample, the following informa 
tion was gathered about the ‘‘doctor”’ 


In at least 50 per cent of the instances in which 
doctor was listed, the following jobs appeared in the 
same group: nurse, college professor, and teacher. In 
at least 40 per cent, the engineer and minister were in 
the same group 

The dish-washer, ditch-digger, farmer’s wife, hired 
man on farm, rancher, and janitor were in the same 
group with doctor less than 5 per cent of the times it 
appeared 

Doctor appeared in at least 50 per cent of the groups 
in which each of the following jobs appeared nurse, 
college protessor teacher, engineer SOK ial worker re 
porter, businessman, and business-woman 

Table 1 shows the percentage of groups made by 
each social class that included both the doctor and 
each of the five occupations listed. Note that the higher 
status subjects tended to group doctor with other pro 
fessionals more consistently than did the lower status 
subject 
The categories of Grouping Test titles unde which 


doctor was listed at least 50 per cent of the times they 


occurred include: educational barrier much educa 
tion, good looking, lot of money, personality barriers 
social skills, ability barriers, brains, verbal skill 


killed,” “‘trained,”’ training barriers 

Doctor was listed under less than 50 per cent of the 
titles categorized as follows: charm, strong, physical 
barriers, little or no education, “‘common labor,” dumb, 


not own boss, little money 


* All the data are available in tabular forr 


author’s dissertation (6) 
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TABLE 2 


A Sampte Jos Ciuster Marrix 


Job Title 


LD Nu Coll 
Co-Members 
Doctor xX +--+ 4 
Nurs 1.4 x 
College Professor + + + + x 
Peacher ++) ++) 4 
Engineer + 
Minister + + + 4 
Social Worker 
News Reporter 
Businessman + 
susiness Woman } 
Air Line Stewardess + 
Phys. Ed. Teacher 
Nonme nboers 
Dish Washer 
Ditch Digger 
Farmer's Wife 
Hire Man Farm 
Rancher 
Married Woman 
Welder 
Pruck Driver 
Maid 
Housewife 
Janitor 
Garhbs ge Ce llector 
Factory Worker 
Farmer 
Cleaning Woman 
Note means that 50 per cent « 
same group as the occupation at the left of the row 
+ means that such co-grouping o« 


tion at the to 


meat hat the occupation at the left appeared in fewer than 
* In the ca 


Data for 


se of news reporter, businessman 


here the other occupations are complete 


From the above data, it was predicted that 


the first Pick-A-Job description, as given above, 
would elicit the suggestion of doctor, among 


others. Actually, although the students had the 
whole world of occupations from which to 
choose, and five different choices to make, 


doctor accounted for fully 10 per cent of all 


jobs chosen for that description 

Similar data for each job were combined to 
define and characterize the commonly per 
ceived job clusters. Table 2 gives, for example, 
the matrix of relationships which led to the 
definition of the cluster (Cluster A) to which 
the doctor belongs. The data presented also 
provide some evidence that Cluster A overlaps 
with another cluster 


4 complete matrix, similar to Table 2 but 


including all 51 jobs on the Grouping Test, was 


used in determining the commonly perceived 
clusters.‘ Unless an occupation was grouped 

‘TI atrix composes Part IV of the Appendix of 
the « ertation, pp 307-316 (6 


and business-woman 


CLuster A 
Teach Eng Min Social News Busm Busw 
++ ++ ++ + + +4 +4 4}. -4. 
+ + + ++ + + + + 
+ + + + + + + ++ ++ + + 
x t+ | ++ | ++ +4 + $+ 
x + 
+ + xX ++ + + 
+ + X 
+ + xX + 
1 t + t xX ++ 
+ + + + + X 
+ + + + +. 
+ + etc.* etc.* etc.* 


r more of the times that the occupation at the top of the column was listed, it appeared in th® 


urred in at least 40 per cent, but not as many as 50 per cent of the total listings of the occupa 


per cent of the groups in which the occupation at the top occurred 


associations were listed 


found with additional co-members not 


with at least one member of a cluster at least 40 
per cent of the time, it was not considered to 
be even a borderline member of that cluster. 
To be considered a clear member, it had to be 
grouped at least 50 per cent of the time with 
most of the other clear members of the cluster. 
A number of jobs were found to be border- 
line or clear members of two clusters, and these 
jobs were taken as evidence that the clusters 
overlap. No jobs were found to be even border- 
line members of more than two clusters. 
Confirmation of the clusters thus derived 
was found in the “nonmembers,” which always 
camefrom “‘distant’”’ (nonoverlapping) clusters, 
and in the data from the Pick-A Job Test. 
Students tended to choose jobs from a highly 
restricted set of clusters for each des« ription on 
the Pick-A-Job Test. These clusters were found 
to be predictable from a knowledge of the 
cluster membership of the job the stereotype of 
which had been drawn upon in formulating the 
description. For example, Description 1 (given 


t 
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above) elicited almost exclusiveiy members of 
the cluster (A-B) to 
belongs and members of the twoclusters (A and 
B) with which A-B overlaps 

Implicit crileria in grouping. Characteristics 
of the clusters were determined by an analysis 
of the type of titles under which the cluster 
members were most frequently grouped—or 


which “businessman’”’ 


not grouped—on the Grouping Test. As shown 
by the types of titles they choose for their 
groups, students typically show consciousness 
of certain dimensions in thinking about jobs, 
most notably: barriers to attaining the jobs 
(especially educational barriers), status, skill 
and knowledge, and the nature of the work. 

The empirical job clusters. The overlapping 
job clusters and the outstanding characteristics 
ascribed to them, as revealed by the Grouping 
Test and confirmed by the Pick-A-Job pro 
cedure, are as follows: 


Cluster A. These jobs are characterized as requiring 


much education, ability, brains, social and verbal skill, 
they are almost never 
Included from the 


minister, doctor, 


and as yielding much money: 
seen as requiring physical strength 
college 


list provided are professor, 


nurse, teacher, and less clearly engineer, social worker, 
and businessman 


These jobs have characteristics similar 


reporter 

Cluster A-B 
to those of Cluster A, but 
educational barriers. They 
and business-woman (also a member 


with much less stress on 


include businessman (also 

a member of A 

of B) 
Cluster B 


requiring verbal skills. They 


These jobs are seen as “business” and as 
include business-woman 
office-worker, secretary, stenographer, bank clerk, and, 


less clearly, salesman, saieswoman, and store clerk 
Cluster ( 


subclusters: C 


These jobs are seen as divided into two 
; 


1, characterized by skill and strength, 


and C-2, seen as hard work and “common labor” done 


by a worker who is strong and not his own boss. C-1 


includes carpenter, mechanic, and welder; C-2 the 


farmer, rancher, hired man on a farm, truck-driver, 
and less clearly, ditch-digger 

Cluster C-D. These jobs are seen as involving hard 
work, “common labor,” and as requiring little educa 
tion or anything else except strength. The workers are 
seen as stupid and as not their own bosse rhey in 
clude ditch-digger (also a member of C), garbage 
collector, and janitor 

Cluster B-D. These jobs are seen a business” and 
as requiring social skill. They include salespeople and 
store clerks (also in Cluster B) and also waiters and 
waitresses 

Cluster D. These jobs are en as requiring little 
(not even strength), as “common labor,” and hard 


work, and their holders as stupid people of little educa 


tion who are not their own bosses. They include dish 
washer, cook, maid, housewife, married woman who 
stays home and kee ps house, cleaning woman and, less 
clearly, elevator operator, waitress (also a member of 


B-D), and janitor (also a member of C-D 


Certain occupations on the list could not be 
placed in the above system of interrelated 
clusters, including one rather weak group 
including actresses, airline stewardesses, and 
models, and a strong one including only police- 
men and firemen. Jobs which seemed to stand 
completely alone were factory worker (dis 
tantly related to C-D and D), aviator, musi 
cian, physical education teacher, farmer’s wife, 
tailor, barber, foreman, railroad conductor, 
and baseball player. A still more exhaustive 
list of occupations might well reveal further 


relationships. 


Prestige Hierare hy 

In the characteristics ascribed to the ciusters, 
the prestige hierarchy inherent in the common 
cognitive structure is revealed, with positive 
and barrier-type characteristics both tending 
to be concentrated in the A and A-B Clusters, 
and least in Cluster D and the clusters over 
lapping it. Physical strength is an interesting 
exception to this rule. This prestige hierarchy 
agrees well with the conclusions of other in 
vestigators (2, 3) have asked 
samples of Americans to rank lists of jobs 


who various 
according to their ‘‘social standing.”’ 

here is a large gap between the perception 
of professionals, business, and office people 
(Clusters A, A-B, and B, but not B-D) on the 
one hand, and manual! and mechanical workers 
on the other, with only relatively rare cases 
occurring in which they are seen as members of 
the same group. Some of the evidence for this 
deep gap may be seen by studying the ‘“‘non 
All of 
extremely rare grouping with Cluster A occupa 
tions are from Clusters C, D, and C-D. The 
gaps between skilled and unskilled workers and 


members” in Table 2 these cases of 


between clerical and professional workers are 


not nearly so wide 


Stereolyped Job Choices 


Students agree remarkably well on the o« 
cupations and clusters of occupations seen as 
appropriate for the people described on the 


Pick A-Job lest their 


hoices are sharply limited to a few 


For each description 


Salient 


possibilities out of the thousands of occupa 


tions from which they were free to choose. In 
answer to Description 1, for example, the fol 
lowing job categories accounted for about 70 
per cent of all the suggestions: salesman 
manager, businessman, teacher, doctor, lawyer, 


and “‘oflice worker.” 
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TABLE 3 
Occupations Most Commonty Groupep Wirtn 
CoLLece Prorgessor ON THE GrouPpinGc TEST BY 
SociaL CLAss 


Percentage of groups 
which contain both the 
occupation listed and 


Co-Member college professor 


Middle 
Class 


Upper 
Class 


Lower 

Class 

Doctor 53°) 70% 75% 
Minister 33 53 63 
Teacher i 73 68 81 
Engineer / 6) 44 58 
Businesaman 50 47 44 
Business- woman @ 43 42 
Office worker 33 26 17 

Note Few of the differences in percentage shown are sta 

tistically significant, but they seem important because of their 


Those for college professor are typical of those for 


consister 


all other Cluster A occupations. There were no relevant cases 


showing an opposite trend 


For Description 2—a man who is: hard 
working, strong, not very bright, and likes to 
mainly choose jobs 
C-D, which 
occupations. The 
with the same traits is steered mainly toward 
Clusters B-D and D 


tions, however, the sex of the person described 


follow orders——-students 


from Clusters C and contain 


mostly masculine woman 


For the other des« rip 


changes the particular jobsselected, but not the 


clusters from which they come 


Group Differences in Perception of Occupations 


Although there is consider 
able the 
common pattern of clusters, there are some 
cognition of the 
notably, students 


( las 
agreement 


differences 
among students about 
differences in 


Most 
from the upper classes tend to make a sharper 


Son ial ( lass 


oct upational world 


distinction between business and professional 
people than do the lower classes. The latter 
tend to lump Clusters A and A-B and B into 
one high-prestige cluster. They also tend to 
exclude from that cluster certain occupations 
otten placed there by upper classes such as 


the social worker, minister, engineer, reporter, 


and musician. Table 3 presents a sample of the 
evidence for these conclusions, using the or 

cupations grouped with college prolessor on the 
Grouping Test as examples. Each value in the 
table represents the percentage of the total 


number of groups, made by the subjects in a 


particular so ial class. which contain both the 


occupation at the left and “college professor.” 
Note that the college professor 1s grouped with 


other professionals more consistently, and with 
business people less frequently, as status rises. 
(See also the data on doctor listed above.) In 
general, the upper classes seem to be more 
finely differentiated about the upper occupa- 
tional strata. There is also some less clear evi- 
dence that the lower classes are more differ- 
entiated about the lower prestige jobs. 

The class differences in characteristics at- 
tributed to particular jobs follow lines predict- 
able from the class differences in the job 
clusters themselves. Subjects from the lower 
classes tend more often to include jobs from 
Clusters A-B and B—as well as A, which all 
classes stress—when listing jobs that share 
certain positive and barrier-type character- 
istics, especially those concerning education 
and ability 

Certain particular occupations on the Group- 
ing Test list were placed in different clusters by 
different classes. The clearest example is the 
interpreted as a professional by 
skilled mechanical 


“engineer,” 
upper 
worker by lower 

These class di 
herent patterns in the data, but could not 
feasibly be subjected to statistical test. The 
group differences described below, except where 
otherwise noted, were found to be significant 
at the .05 level of confidence 

Subjects higher in social status appear to be 
relatively jacking in both status and barrier 


classes and as a 


f 


” ' ; 
erences are evident in co- 


consciousness, but relatively strongly conscious 
of skill and knowledge, pace of work, and 
money. It may be that students are less aware 
of the job aspects about which they feel less 
insecure. The fact that students whose bread- 
winners are in manual or mechanical work are 
relatively less conscious of strength seems to 
support this inference 
The Pick-A Job Test 
greater tendency of the lower classes not to 


differentiate the top three job clusters. While 


results reflect the 


higher status subjects restrict themselves 
closely to professionals and business people for 
both the male and female version of Descrpi- 
tion 1, clerical occupations are often selected 
by lower classes. For example, clerical work is 
suggested for the male version by 31.3 per cent 
of the lower-class subjects, by 15 per cent of 
the middle class, and by only 2.6 per cent of the 
upper class (difference significant at the .01 


level). For Description 2, upper classes tend to 
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name more stereotypically low-prestige jobs 
like the garbage-collector and ditch-digger 
perhaps again indicating their coarser distinc- 
tions among lower-strata occupations. 

Sex differences. The study revealed some out 
standing differences in the way boys and girls 
see the world of jobs. Through the titles they 
used on the Grouping Test, girls showed more 
awareness—or at least more articulateness 
concerning all the common types of job aspects, 
with the exceptions of security, money, and 
work. If the female is more status 
conscious, as has been claimed, she is also more 


hours of 


conscious of a large number of other perceptual 
dimensions of jobs 

Sex differences were also found in the answers 
to Description 1 and its feminine counterpart 
on the Pick-A-Job Test, with the girls showing 
a stronger tendency than boys to name teach 
ing jobs for both versions. Boys tend to select 
clerical rather than professional jobs for the 
female version, while girls choose mostly pro- 
fessional jobs, as do both sexes for the male 
version—perhaps a sex difference in willingness 
to limit capable fernales to lower social position. 

Regional differences. In _ broad 
students in all eight schools see the same world 


outlines, 


of occupations. But consistent trends in the 
data indicate that those in the large industrial 
communities like higher 
to see a relatively sharp profes- 
Students in the 
their 


ized tend, status 
subjects, 
sional business distinction. 
Southeastern 


Grouping Test titles, to be more conscious of 


schools appear, through 
status and education than the others tested; 
but 
similarity in the types and relative potencies of 


otherwise there is considerable regional 
dimensions used for categorizing occupations 
Ihe social worker is the outstanding example 
of a job that is grouped differently in different 
regions: it is much more clearly professional in 
the large Northern cities. 

Despite the class, sex, and regional differ 
ences mentioned, in general the students seem 
to agree rather closely in their basic perceptual 


structure concerning occupations 


DISCUSSION 


The Grouping Test technique, in which a 
collection of items is set before the subjec t lor 
him to record the groups or units that he per 
ceives spontaneously, is easily adaptable to 


many problems of social perception. Instead of 
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occupations, one could list, for instance, names 
of political figures, nationalities, religions, 
brands of food, 
periodicals. The general method of the Pick-A 
Job Test is likewise adaptable. Since it in- 
volves giving the subject hypothetical group 
characteristics and asking him to name mem- 
bers of the group, it could be called the Pick- 
The-Members technique. In other adaptations, 
subjects could be asked to name, instead of 


jobs, clubs to which the described individual 


business corporations, or 


might belong, public policies he might be 
likely to favor, movies he might enjoy, etc. 
The Haire method (Describe-The-Person or 
Describe-The-Member), used here in 
junction the Pick-A-Job Test, seems 
generally adaptable for yielding hypotheses 
about the characteristics of perceived groups. 
Together the two methods can be used to con- 


con- 


with 


firm Grouping Test data about the composi 
tion and characteristics of the perceived units. 
If the conclusions from the Grouping Test are 
valid, use of the Pick-The-Members technique 
on the same subjects should yield the members 
of the appropriate Grouping Test cluster, 
when that cluster’s characteristics are used to 
form the description on the Pick-The-Members 
Test. And the Describe-The-Person 
should tend to yield the characteristics found 
through the Grouping Test for the group of 
The 


various methods were used in this manner in 


method 


which the Person is seen as a member. 


the present study to complement and validate 
each other. 
Some of the implications of the study for 


vocational guidance are discussed elsewhere 


(S) 


SUMMARY 


A phenomenological study of the way 
American high school students perceive occu- 
pations is presented. A perceptual structure 
consisting mainly of seven overlapping job 
clusters is found to be characteristic of most 
subjects. Class and regional differences in the 
the different at 


tributes ascribed io each cluster are examined 


structure are analyzed and 


[wo original types of methods are used which 
are adaptable to many other problems in 


social perception 
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AGE DIFFERENCES IN THE PERCEPTION OF CLOSURE! 
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Institute for Psychosomatic and Psychiatric Research and Training, Michael Reese Hospital 


HE changes associated with aging are 

many and varied. Certain alterations in 

sensory acuity, speed of response, and 
performance on motor, memory, and selected 
intellectual tasks have been reasonably well 
established both by common observation and 
controlled experimentation (6, 8, 17). Yet the 
number of investigations concerned with more 
complex perceptual functioning in older per- 
sons has been surprisingly small. Of particular 
interest are comparative studies of Rorschach 
protocols typifying groups of differing ages 
and their implications for distinctive aspects of 
personality organization (3, 4, 5, 11, 18). 

In the course of a larger research project 
dealing with psychological and neurophysio- 
logical correlates in aging, some provocative 
findings emerged related to a more circum- 
scribed type of perceptual ability—that of 
closure. It is the purpose of this paper to con- 
sider this material. More specifically, the data 
to be presented involve the performance of two 
groups widely separated in age on two kinds 
of perceptual tasks which earlier factorial 
studies have claimed to reflect “the ability to 
fuse a perceptual field into a single percept” 
(21) and “flexibility of closure” (20). Within 
a broader context, this study is conceived as 
an attempt to enhance our general understand- 
ing of cognitive functioning in senescence 
Related investigations are reported separately 


(14, 15) 


METHOD 


I'wo populations were sampled: a young” group of 
16 doctors and nurses con posed of 5 men and 11 women, 
and an “old” group of 16 residents, 8 men and 8 women 
in a home for the aged. The ages of the young ranged 
from 22 to 33, with a mean of 26.8 years; those of the 
old frora 68 to 88, with a mean of 78.1 years. All Ss 
' This study was supported by the U.S. Army under 
Contract No. DA-49-007-MD-261 through the Medical 
Research and Development Board Office of the Sur 
geon General, Department of the Army, Washington, 
D. C. We gratefully acknowledge the aid of Mr. Ben 
Grossman, Director, and the staff of the Drexel Home, 
Chicago, in providing subjects and facilities. Dr. Albert 
Caron and Mrs. Ann Gunther were responsible for 
much of the data collection and Dr. Helen Heath for 


the statistica! analyses 


were physically and mentally healthy and had adequate 
vision, using corrective lenses where necessary. Since 
one of the techniques to be discussed, Concealed Fig 
ures, has been found by other workers to be saturated 
with an intellectual factor (20), the groups were chosen 
so as to be roughly equated for intelligence. They were 
drawn from 24 young and 36 old persons used in the 
total study, to whom a battery of tests had been ad 
ministered which included the Vocabulary subtest of 
the Wechsler-Bellevue Intelligence Scale (22) as well as 
the two perceptual tasks of concern in this paper 
Matching for general intelligence was performed inde 
pendently on the basis of vocabulary level, since it has 
been adequately demonstrated that this index declines 
relatively little in the normal process of aging (1, 2, 10, 
22 The mean and SD of unweighted vocabulary 
scores for the young Ss so selected were 26.8 and 4.6; 
for the old, 28.6 and 4.8, respectively. Although these 
restricted groups enable us to focus most directly on 
influences of age uncontaminated by differences in 
intellectual ability, parallel analyses of the data have 
also been carried out for the larger parent populations 
In all instances the age differences in performance are 
similar in direction and extent to those of the present 
study, at times being perhaps more extreme 

I'wo procedures were used 

Gestali Completion Test. In this task, adapted by 
rhurstone (20) from a technique developed by Street 
19), S is presented with a series of 72 ambiguous 
drawings which he is required to identify. Portions of 
the drawings are obliterated and at first view the 
material appears to consist of diverse and unrelated 
black blotches. Actually the fragments are arranged 
80 as to suggest the outlines of distinct persons, objects, 
and/or situations. The accomplishment of closure in- 
volves organizing the seemingly scattered elements so 
that a compelling and unified whole emerges. In his 
factor-analytic studies of mechanical aptitude Thurs 
tone (21) concluded that the test represented “the 
ability to fuse a perceptual field into a single percept.” 

Following some examples, the instructions were 
“When the signal is given, turn the page and identify 
some more pK tures of the same kind. Work as rapidly 
as you can. /f some pictures are too difficult, skip them, 
and return to them later if you have time.” Scoring criteria 
corresponded to those followed by Campbell and Zaks* 
in recent unpublished studies. Responses were scored 1 
when the activities depicted or all of the persons and 
objects in the picture were accurately described. Half 
credit was allowed when partial integrations occurred 
and these were essentially correct 

Concealed Figures. Thurstone’s (20) version of the 
Gottschaldt Figures Test (9) was employed. Forty-nine 
rows of figures are given, with a simpie geometric design 
appearing at the left of each row followed by four more 
complex designs. The S identifies each of the more 
complex designs in which the simple figure is em 


* Personal communication 
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rABLE 1 
MEAN PERFORMANCES OF YOUNG AND OLD 
(;ROUPS ON GESTALT COMPLETION 
TEST 


Grou Attempted Right Wrong Omitted 

Young 16 63.1 4.7 6.0 12.4 

Old 16 @.7 15.5 15.9 29.4 
bedded. Since the complex figure has “good gestalt”’ its 
properties of closure must be resisted as S searches for 
the smaller figure potentially hidden therein. The 
principal component of this test was termed by Thur 
stone (20 flexibility of closure.” 

In essence the instructions stated: “In every row of 
designs the figure is concealed in at least one drawing 
and may be in as many as all four. Put a check mark 
(./) in the parentheses below a drawing if it contains 
the figure. Put a zero (0) in the parentheses below a 


drawing if it does not contain the figure.” 


Koth procedures were administered in booklet form 


wit! eparate answer heet Testing occurred in indi 
vidual sessions having a time limit of 10 minutes for 
each test. The instructions and illustrative examples 
on the front pase of the booklets were read aloud and 
discussed to ensure that S fully understood the nature 
of the task 


RESULTS 


During the 10-minute test period provided 
F 

for Gestalt Completion, S undertook as many 
problems as possible, some responses being 


correct, some incorrect, and others omitted 
In ‘Table 1 are presented the mean frequencies 
of each type of response for ea h group. First 
to be noted is the lack of any appreciable differ 
ence between the number of items attempted 
by the young and old groups. “Attempted”’ is 
defined by the last item completed 
that both at least 
equivalent portions of the total test in the time 
allotted 

Striking differences are found for the number 
The young 
in contrast 


Thus it 


appears groups examined 


ol drawings correctly perceived 

have a mean number right of 44.7, 
to 15.5 for the old. The ¢ between these values 
is 7.68 which with 15 d/ 


sugyestion of halving the number of degrees of 


following Edwards’ 


freedom in cases of unequal variance (7, p 
170)-—-1s significant at p O01, Equally sharp 
differences are seen on number of items incor 
rectly identified by each group, the mean for 
the young being 6.0, while that for the old is 
15.9. These differences also are highly reliable 
(¢ =: 6.11. df 30, p 001) 
old group has a significantly greater frequency 


Moreover, the 


of failures to respond, 29.4 omissions as com- 


pared to 12.4 for young Ss (/ 5.44, df = WH, 
p < .001). “Omissions” are simply the number 
of items left blank of the total number at- 
tempted. It is therefore quite clear from these 
data that the ability of older persons to achieve 
perceptual closure is distinctly inferior to that 
of young individuals. 

A qualitative analysis of wrong responses 
offers further insight into the characteristic 
functioning of the two greups. Almost invari- 
ably the types of errors committed by young 
Ss consist of inaccurate integrations of the 
entire picture. That is, the disjointed parts are 
fused into a whole, but the closure effected is 
not the intended figure. Sometimes the re- 
sponses given depart but little from those 
credited as correct, and quite often the scorer 
can, as it were, perceive the figure as did S. 
By contrast, older persons usually, though less 
often, incorporate all the fragments, but in 
generally loosely organized and sometimes 
clearly distorted impressions. In addition, al- 
though occurring infrequently, all of the old 
exhibited one or more of the following features 
details of the 
drawings, perseveration of the same response 


in their records: fixation on 
on successive drawings, and a kind of passive 
acceptance of the elements as separates, with 
the last category are 
“clouds,” and “‘flock 
of birds,” which imply a basic deficiency in 


no genuine closure. In 
such answers as “rocks,” 


organizing activity. 

Table 2 describes the results for Concealed 
It is immediate'y apparent that the 
considerably more 
The difference 
far exceeds statistical significance (f 5.14, 
df 30, p 001). However, even within the 
number of items completed the groups differ 


Figures 
young group completed 
items on this test than the old 


significantly in relative adequacy, as measured 
by the ratio of right minus wrong divided by 
total attempted (/ 4.99, df 30, p 001). 
This finding is even more impressive in view of 
the fact that the task seems to be graded in 
order of increasing difficulty. Thus the greater 
number of designs attempted by the young 
necessarily included proportionately more 
difficult items 

An effort was made to determine whether 
the types of error provide any further clues as 
to the differential efficiency of our samples on 
Concealed Figures. However, the data on hand 


allow only a very limited approach to this 
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rABLE 2 
MEAN PERFORMANCES OF YOUNG AND OLD 
GROUPS ON CONCEALED FIGURE 

Not Perceived 

At Perceived Whe 

G \ meatal Rig W » her Not 
Present | Present 

Young 16 91.1 77.3 13.8 10.3 3¢ 

Old 16 55.5 36.9 18.6 13.9 47 


question. A design is scored “wrong” under 
either of two conditions: 1. the simple figure is 
not perceived when present in the complex 
design, and 2. the simple figure is perceived as 
present in the complex design when such is not 
the case. For both young and old the first kind 
of error is decidedly more prominent. More 

over, the findings are similar for the two age 
groups; the proportion of each variety of error 
to total errors is approx mately eqi:al in both 


DISCUSSION 


Perception is conceived to be a resultant of 
an interplay between field and individual fa 
tors. Concerned with demonstrating the im 
portance of closure as one of the major organ 
IZINZ prin iples in perception, pioneer Gestalt 
psychologists (12, 13) employed configurations 
which maximized organizational forces exter 
nal to the person; hence closure was compelling 
and instantaneous In an incomplete but 
“pregnant” field, e.g., a circle with a small are 
missing, the patterns themselves uniformly 
enjoined identical closure. Similarly, with a 
highly organized and stable field, as in Gott 
schaldt’s original experiments (9), the existing 
closure was too coercive to segregate parts 
analytically. In these cases both effecting and 
resisting closure were viewed primarily as 
functions of field organization 

However, where field conditions are less 
coercive the manner and degree of closure is 
more likely to reflect the organizational skill of 
the observer. Consider first the problem posed 
by the Gestalt Completion Test. Here ’s 
task is essentially organizational: given a frag 
mented drawing he must consolidate the parts 
and visualize an integrated whole. Variations 
in the stimuli alone may make integration 
easier or more difficult depending on the 
number, size and location of gaps, and the 
complexity of the entire drawing; but whether 
and how closure is achieved depends in large 


measure on the individual’s ability. The situa 
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tion in the Concealed Figures, which requires 
S to counteract tendencies toward closure, 
may be viewed similarly. To the extent that 
factors in the field enhance closures that are in 
conflict with a particular closure sought, the 
capacity of the individual to withstand and 
discard “erroneous” closure is revealed. Thus 
both the completion of and the resistance to 
closure are aspects of cognitive functioning 
which depend on S’s general integrative abil 
ity (16 

When dealing with Gestalt Completion, not 
only are the aged shown to be generally less 
adequate in developing closure, but, more 
specifically, their responses suggest a loose 
and/or overly rigid cognitive structure. Per 
cepts are relatively vague and poorly inte 
grated, bearing little resemblance to the figure 
delineated. They are also focused on details 
to the exclusion of other aspects ol the draw 
ing, perseverative and concretized in the sense 
that each segment given as a separate is treated 
as a separate and the relationships between 
adjoining segments are not envisaged 

Evidences of similar cognitive process in the 
aged were encountered by us on a task involv 
ing the judgment of ambiguous stimuli (14) 
Che Ss were instructed to respond either ‘‘cat”’ 
or “‘dog’”’ to each of a series of 13 line drawings 
graded through successive small modifications 
from a clear cat at one extreme to a clear dog 
at the other. Older persons, in the face of the 
progressively changing information provided 
within this order of cards, tended to exhibit a 
later point of shift in judgment as well as con 
siderably more vacillation in sequence of 
judgments than younger individuals 

The deficit of aged persons in organizing 
ability is described perhaps even more strik 
ingly in Rorschach investigations of senes 
cence. Because of the greater ambiguity and 
lack of structure in the stimuli and the in 
creased demand that organization be provided 
by S it is not surprising that repeated studies 
have found a distinct constriction and stereo 
typy of thought, confusion and cautiousness 
(4, 5, 12, 18) 

Unlike the synthetic ability needed on Ges 
talt Completion, for Concealed Figures S must 
adopt an analytic attitude in order to discover 
whether the simple figure is embedded in the 
complex design. Each complex design contains 


many lesser figures which overlap and are 
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Identical 
lines form one or 


interwover! in intricate patterns 


lines and combinations of 
more sides of a variety of different possible 
figures having the attribute of good form. The 
itself is a harmonious 


entire contguration 


whole. Thus the task basically taps the ability 
to maintain and recognize a given closure de 
spite the intrinsic distractions of a more stable 
larger field and equally good competing ges 
talten. It is in this particular sense that the 
aged are found wanting in flexibility of closure. 

A special problem arising is that of explain 
ing the matter of omissions. The presence of 
omissions on Gestalt Completion is probably 
attributable to the instructions which, it will 
be recalled, permitted the option of leaving the 
But our 


two groups exercised their option in different 


item blank if it proved too difficult 


measure. The old produced a significantly 
higher frequency of failures to respond than 
the young. We would hypothesize that this 
finding reflects one or both of the following 
characteristics in the aged: 1. greater impo- 
tence in the closure function, a phenomenon 
already witnessed in the analyses of responses 


actually given; and 2. a more cautious attitude, 
i.e., the need for a high degree of certainty 
This tendency to 


before committing oneself 


inhibit response may represent a generalized 
adaptive mechanisra through which the aged 
person avoids recognizing his inadequacy. 
Similar behavior in this same group was evi 
denced on a verbal learning experiment in 
cluded in our battery (15). In paired-associate 
learning the older individuals, both initially 
repeated trials, much more usually 


the stimulus item 


and of 


gave ‘‘no response” when 
appeared 


However 


were no omis 


unlike Gestalt Completion, there 
ions on Concealed Figures. The 
reason is apparent in the set provided S by 
He searches the large design 
the 


ivgnment 


rmine whether it incorporates 


figure. If he discovers that it does, he 


the item; if he cannot perceive that it 


then he assumes the figure does not 


exist within the large design and marks the 


weordingly Under these conditions a 


failure to respond is therefore not likely 


SUMMARY 


\ young and old group of 16 Ss each, mean 


ages 26.8 and 78.1 yearsand equated for intel 


7 


ligence, were compared in performance on two 
tests of perceptual closure, Gestalt Comple- 
tion and the Thurstone adaptation of Gott- 
schaldt’s Concealed Figures. The former pro- 
cedure requires the capacity to organize, the 
latter to resist closure. Older Ss were distinctly 
inferior on both tasks. Moreover, an analysis 
of “wrong” responses on Gestalt Completion 
revealed signs of perseveration, fixation on 
details and concretization by the aged individu- 
als, as well as a more general tendency toward 
vagueness and poor articulation of form. The 
old also omitted a significantly greater number 
of items on Gestalt Completion. This finding 
was interpreted as indicating one or both of 
the following: further evidence of a lessened 
integrative ability, and excessive cautiousness, 
i.e., a defensive reluctance to venture response 
for fear of recognizing their inadequacy. The 
over-all results were viewed as reflecting the 
overly loose or overly rigid cognitive function- 
ing of the aged. 
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DANUTA EHRLICH, ISAIAH GI 
l'niver ily 
DER what condition will persons 


voluntarily expose themselves to in 


formation? This is an important 


theoret ind practical question tor those 
interested in social influence processe It is 
widely iccepted inat the audiences of ma 
media are to a large extent self-selected. How 
ever, there has been little ngorous specification 
or tematic research concerning the variables 
related to seeking out or avoidance of new 
information. Festinger has recently proposed a 
theory (2) which make ome definite predic 
tions about selective exposure to inlormation 
following decisions, and it is the purpose of this 
tudy to test several of them. Before the pe 
cific predictions from the theory are stated the 
relevant parts of Festinger’s theory will be 
ummarized brietly 

The theory is concerned with relations 
amony cognitive elements, that is, things which 


a person knows about himself or his environ 
ment. For example, knowing that one is on a 
picni a cognition about one’s behavior; 
knowing that it IS raining, a cognition about 
the environment. Cognitive elements may be 
either consonant, dissonant, or irrelevant to 


one another. A behavioral element is consonant 


with an environmental element if, considering 
these elements in isolation, the behavioral 
would follow from the environmental element; 
dissonant if, considering them alone, it would 
nol follow. Thus knowing that it is raining is 
ordinar dissonant with going on a picnic, 
consonant with staying home 

According to the theory, the presence of dis 
onance wpive rise to pre ure to reduce or 
eliminate it. The strength ol these pre sures 1s 
1 function of the magnitude of the dissonance 
The theory also state that the greater the 
importance or the value of the elements in 

I} esearcl i i t ( the cure ) 
of Dr. La Ie er while the aut were me ri 
of | ite ir at the | ersity of M esota 
Ihe to thank |} for | gen iS adivice 
ul t ( ‘ t I} t va l i 
of the research progra the La at f Research 
in Social Relatior 


volved, the greater is the magnitude of the dis- 
sonance and hence the greater the pressure to 
reduce it 

The total magnitude of the dissonance be- 
tween a behavioral element and the cluster of 
relevant environmental elements, assuming 
they are all equally important, is equal to the 
proportion of dissonant elements in this cluster. 
rherefore, one way that a person can reduce 
environmental ele- 


dissonance is to add new 


ments that consonant with his behavior. 


Thus the theory 
dissonance in general seek out consonant infor- 


are 
predicts that persons with 


mation and avoid information that would in- 
troduce new dissonant elements 
When a person makes a decision, a corre- 


sponding behavioral element is established and 


his cognitions about the alternatives among 
which he has chosen are then consonant or dis- 
sonant with this element. All the favorable 


aspects of the unchosen alternatives and un- 
favorable aspects of the one chosen are dis- 
the 
creation of dissonance is 
decisions. After the choice has been made, the 


sonant with choice. Consequently, the 


a common result of 


person therefore tends, act ording to the theory, 
to expose himself to information that he per- 
ceives as likely to support the decision and to 
avoid information that is likely to favor the 
unchosen alternatives 
The purchase of a new automobile, for 
example, is usually a rather important decision 
for a person. Considerable dissonance should 
exist for a new car owner immediately after he 
has bought his car; all ‘‘good”’ features of the 
makes he considered, but did not buy, and 


bad”’ features of the one he bought are now 


dissonant with his ownership of the car. He 
should also attempt to reduce this dissonance. 
In this instance of postdecision dissonance, 


sources of information in support of the de- 
cision are readily available. Since automobile 
advertising contains only material favoring the 
particular car advertised, reading advertise- 
ments of his own make is one way a new car 
owner can information his 


get supporting 
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choice and thereby reduce his dissonance. On 
the other 
other cars may increase his dissonance 


hand, reading advertisements of 
rhe specific hypotheses of the study were as 
follows: 

1. After a decision persons tend to seek out 
dissonance-reducing information. Thus new car 
owners will read advertisements of their own 
cars more often than those of (a) cars which 
they considered but did not buy and (8) other 
not involved in the choice. 

2. After a 
dissonance-increasing information 


cars 
tend to avoid 
Thus 


advertisements of 


dec ision persons 


new 


car owners will read con 


sidered cars less often than of other cars 


Postdecision dissonance is, in general, 


reduced over a period of time. Thus these 


selective tendencies in readership of adver 
tising will not exist for owners of old cars 

The theory predicts other consequences of 
postcecision dissonance, for example, changes 
in desirability ratings of chosen and unchosen 


(1) 


Our present_concern is limited to the effect of 


alternatives as investigated by Brehm 


decision-making on exposure to information 


METHOD 


Qne hundred dents ol 
of eight 


interviewed? 


and twenty-five male re 


Minneapoli ind its suburbs. who owned one 


were in an 
appointment had been 


of the 


advertising survey alter an 


made by telephone. Sixty-five respondents were 


ew car ow! chosen randomly from a list of recent 
auto registrations, dated four to six weeks before the 
time of the interview. The other 60 respondents owned 
cars manufactured in 1952 (three years before the 
study was done) or before, and were selected from the 


directory 


Actually 


ywought 


Minneapo telephone about a 
third of the old car 


none of the 


owners | their cars after 
purchases were made during 
number 


the year in which the study was done Phe 


of owners of different makes o ir vas approximately 


equal in both groups 

During the course of ne nterview 
1. Each respondent wa 
ents whicl 


where and 


asked to recall recent auto 


had impressed him par 


ticularly and to say when he had seen the 


advertisement his impression was 


whether or 1 " 
Recal! Data 


yf 
favorable, and how much he had read 


2. Each wa hown issues of popular magazines 
which appeared tour weeks or less before the interview 
after the new car owners had made their decisions) 
and copies of one of the two Minneapolis newspapers 
io even da prior to the interview. Only those 
| catior were included which the respondent pre 
viously indicated as having read regularly For each 


asked if he had seen it, and 


raig Erickson and Peter 


ance in conducting the interview 
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if so, he was shown each car advertisement it contained, 
had noticed the advertisement, and, 
had 


just glanced at it” (Re 


asked whether he 


if he had, whether he “read all,”’ “read some,” or 
ynition Data 
3. Each was presented with eight large plain envel 


The 


“new 


opes bearing the names of the eight popular makes 


respondent was told that his reactions to some 


automobile advertising’ were desired and asked to 


choose two of the envelopes in order to read and com 


ment upon the material they contained Envelope 
Data 

At the conclusion of the interview the respondent 
was asked to name other makes which he “seriously 


considered” before he decided to buy his present car 


RESULTS 
As a prerequisite for the predicted selectivity 
in a person’s exposure to information, he must 
the likelihood 


reduce or increase 


have about 
that a 


dissonance for him. 


some expectation 


given item will 


Thus the new car owners 
must first recognize that a particular make is 
they must at least 


being advertised, i.e., 


notice the advertisement. For each respondent 
the 


read 


percentage of advertisements which he 
of those which he noticed, was therefore 
calculated separately for his own car, the cars 
he considered, other cars not involved in the 
choice, and also for those considered and other 
cars combined. (The categories ‘‘read all”’ and 
“read some’”’ were combined because there were 
two few advertisements reported as read com 
pletely to make a separate analysis feasible; 
the for 
categories.) The means of these percentages 


results are essentially same these 
for the two groups and the corresponding N’s 


are reported in Table 1, which also contains 


rABLE 1 


MEAN PERCENTAGES OF ADVERTISEMENTS 
NOTICED AND OF ADVERTISEMENTS 
Reap or THose NoTicep 


(Corresponding N’s appear in parentheses 


Make Advertised 
Car 
Owners Considered 
Ov ( | ( Oth 
wn onsidered her Plus Other 
Percentage Noticed 
New 70 (52 66 (50) 146 (64) 48% (64) 
Old 66 (S51 52 (31 10 (@) 41 (GO) 
Percentage Read of Those Noticed 
New 67 (47 39 (46 44 (64 $5 (6A) 
Old 41 (44 SS (20 27 (57 30 (57 
Note The N are reduced because in some cases t uliver 
eme ff a particular kind appeared in the issues shown or 
none of those w appeared were noticed. They are further re 
juced because t all respondents named cars a seriously ’ 
lered.”’ 
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FABLE 2 


1GN Test COMPARISON: 


(ar Owners 


omparisor 


New Old 
Y% Own Minus % Con + 21 s 
sidered 4 6 
f 01 n.s 

Between Groups p « 05 (7 43.90)* 
Y Own Minus % Other + 31 20 
a 16 

t 01 r 

Between Groups p < 03 (x t.93 


* Corrected for continuity 


means for the percentage of advertisements 
noticed of those which were shown 


Table 1 


owners read advertisements of their own make 


It can be seen from that new car 
more often than advertisements of both con- 
sidered and other cars. These differences do not 
old fact the 


difference between reading of own and consid- 


appear among car owners; in 
ered car advertisements is in the opposite direc- 
tion. However, it is obvious from Table 1 that 
new car owners do not read advertisements of 
considered cars Jess often than of other cars. 
Finally, it is apparent that readership of ad 
vertisements, in general, iS greater among new 
car owners 

Ihe significance of these differences was de 
termined by means of the sign test. For each 
respondent the sign of the comparison between 
the percentage of advertisements read of the 
different 
tribution of signs was tested using the critical 


categories was found and this dis- 
values given in Mosteller and Bush (3). Results 
of these comparisons appear in Table 2 which 
that apparent from 


Table 1 are significant car owners read 


differences 
New 


significantly more advertisements of their own 


shows the 


car than of both considered and other cars; 
none of the other comparisons yield significant 
differences. It can be seen that the N’s have 
been further reduced because these compari 
sons were not possible for all respondents 

The significance of the differences between 
new car and old car owners was tested by 
means of chi square. Results of these tests are 
also given in Table 2. The comparison own vs 
considered is in the predicted direction with 
p « OS 


cant at the .03 level, and the comparison own 


The comparison own vs. other is signifi- 


vs. considered plus other is significant at the .01 


BETWEEN PERCENTAGE OF ADVERTISEMENTS READ OF DIFFERENT 


ATEGORIES 


Car Owners 
Comparison 
New Old 
% Own Minus % Con +- 32 20 
sidered Plus Other 7 17 
p< 01 n.s 
Between Groups p < O1 Gc = 6.89 
Y% Considered Minus % + 19 10 
Other 15 7 
n.s n.s 


Between Groups, n.s 


level; there is no significant difference for the 
comparison considered vs. other. 

One other comparison in the recognition data 
is worth noting.* New car owners were divided 
into two groups; those who named less than 
two cars as “seriously considered” and those 
who named two or more. The difference in 
reading own vs. considered plus other was exam- 
ined separately for these two groups and was 
found to be greater for those who considered 
more than two cars. That is, new car owners 
who named more than one considered car read 
relatively more advertisements of their own 
car. For this group the mean difference of the 
percentages was 37; that of the other group 
was 26, The difference between the two groups 
was tested by a median test and the resulting 
(two-sided) p 19 (12 of the 19 who consid- 
ered two or more were above the median; only 
12 of the 28 who considered less than two were 
above it). 

For the Envelope Data the difference be- 
tween groups is in the predicted direction but 
not significant. Fifty-one of 60 (85%) new car 
owners chose an envelope of their own car, 45 
of 59 (76%) old car owners chose one of their 
own car. Respondents in both groups obvi- 
ously chose to read and comment upon “‘new 
advertising” of their own makes far more 
often than would be expected by chance. 

The number of impressive advertisements 
(Recall Data) which the respondents read is 
too small to make analysis of these data pos- 
sible. (Of course no advertisements were in- 
cluded which appeared before the new car 


+See the section for the theoretical 


relevance of this comparison 


discussion 











POSTDECISION EXPOSURE 


owners made their decisions or before a com- 
parable length of time for old car owners.) 


Disc USSION 


The data from the recognition of advertise- 
ments which appeared in recent publications 
(after the new car owners made their decisions) 
strongly support the prediction of Festinger’s 
theory that, after an important decision, per- 
sons seek out consonant information. New car 
owners were found to read advertisements of 
their own car more often than those of cars 
they considered but did not buy, and of other 
cars not involved in the choice. That this 
selective tenden y in exposure to information 
is not simply a result of car ownership but is 
related to decision making is demonstrated by 
its failure to occur among old car owners. The 
Envelope Data also indicate that new car 
owners prefer to read advertisements of their 
own car although the difference between new 
and old car owners is not significant. 

An alternative explanation of the results 
might be offered. If people do not remember 
accurately when they read an advertisement, 
and if shortly before buying a car, persons are 
more likely to read advertisements of that car 
than of others, then the advertisements of 
their own make may have actually been read 
before the decision and the obtained differences 
may be a result of the new car owners’ inability 
to remember the issue in which they read them. 
rhis interpretation seems unlikely since it is 
plausible to suppose that, before their purchase, 
the new car owners attempted to learn as much 
as possible about all the different makes in 
volved in their choice and therefore read ad 
vertisements of the makes that they considered 
as often as those of the make that they finally 
chose. Nevertheless, the possibility that it 
could account for the data was checked by 
determining whether the advertisements of 
the make actually chosen by the new car 


owners which they said they read also appeared 
during the month prior to their purchases. 
Only four of 54 such advertisements also 
appeared in this period. When these few in- 
stances in which the alternate explanation 
might apply are eliminated from the analysis 


the results are essentially unchanged.‘ 


There are only two smal! changes in the ignifi 
cance tests reported in Table 2. When the ir ads 
are elir eC the x28 lor the compariso etwe 


rO RELEVANT INFORMATION 101 


The prediction that new car owners would 
read fewer advertisements of considered cars 
than other cars was not substantiated. How- 
ever, the data are not sufficient to reject con- 
clusively the hypothesis that dissonance-in- 
creasing information is avoided after a decision. 
It is possible that some of the new car owners 
expected advertisements of considered cars 
to decrease—not increase—their dissonance 
They may have read the advertisements of the 
rejected cars not with the intention of getting 
further information about their advantages 
but for the purpose of finding fault with them, 
comparing them unfavorably with the chosen 
car. For example, an owner of a make with 
high horsepower who regards horsepower as an 
important feature of automobiles may read 
advertisements of considered cars of lesser 
horsepower to remind himself of their inferi 
ority in this respect. We would not expect this 
to occur frequently but it might happen often 
enough to account for the failure of the predic 
tion. The data do not provide any test of this 
interpretation 

The indication that new car owners who 
named more than one considered car tend to 
read relatively more advertisements of their 
own car than do those who named one or none 
is additional, suggestive evidence for Fes 
tinger’s theory. Since the total magnitude of 
dissonance is a function of the proportion of 
dissonant elements, we would expect that these 
persons would in general have greater dis- 
sonance, for as the number of unchosen alter- 
natives increases the proportion of dissonant 
elements increases. Therefore the theory pre- 
dicts that they should show a greater tendency 


to seek out consonant information. 


SUMMARY 


Readership of auto advertising by new and 
old car owners was investigated in order to test 
some predictions of Festinger’s theory of dis- 
sonance concerning selective exposure to infor- 
mation following decisions. It was found that 
new car owners read advertisements of their 
own car more often than of cars they consid- 
ered but did not buy and other cars not in- 
volved in the choice. These selective tenden 


groups for own minus considered and for own minus 
considered plus other are reduced to 3.61 and 5.90 


respectivel 
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cies in readership were much less pronounced 
among old car owners. This finding supports 
the theoretical derivation that persons in gen 
eral seek out Consonant or supporting inlorma 
tion alter an important decision in an attempt 
to reduce dissonance resulting from it 

From the derivation that persons tend to 


avoid } 


onance-increasing inlormation it was 


ar owners would read 


pred ted that new 
advertisements of considered cars less often 
than of other cars. The data do not confirm 
this prediction. It was suggested that some of 


the new car owners may have expected the 


P 


SCHONBACH, AND J 
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considered car advertisements to decrease 


not increase—their dissonance. 
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A NEW LOOK AT VIGILANCE AND DEFENSE! 


DONALD 


Teachers ( llege, 


s New Look experiments continue to 
accumulate, the exact nature ol per- 
ceptual vigilance and defense remains 

open to question. Various investigators (1, 2, 7) 


have shown that anxiety-provoking stimuli are 
recognized at both higher and lower thresholds 


than control stimuli, but the meaning of these 


threshold deviations is still unclear and their 


not established. Findings are am 


reliability 


biguous partly because (a) the affective quali 


ties of the so-called “taboo”? and control 
stimuli are usually assumed, not predeter- 
mined (4, 5, 8); (6) the frequency counts 


generally used often do not apply specifically 
to the experimental sample; and (c) thresholds 
are usually measured only once, after the 
experimental threat has been introduced (5, 8 

heck on whether the stimuli 
matched, ol 
the effect of residual structural 


il 


There IS thus ho 


were truly and thus no means 


parlialling out 
difference 
Apart from these procedural shortcomings, 


in “taboo’’ and control words 

the question still remains as to what processes 
contribute to vigilant and defensive responses 
studies (1, 


that threshold scores are correlated with per 


Some have succeeded in showing 


sonality factors, but the basis for the correla 
tions has not yet been adequately explained 
Vigilance and defense have not been explic itly 
distinguished in terms of the perceptual opera 
tions involved, and the possibility exists that 
they are two manifestations of the same under 


lying variable 


As a start toward a single-factor theory, it 

might be argued that both vigilance and de 
It ver is based « i dissertation submitted in 

candida for the degree of Doctor of Philosophy at 
Teachers College, Columbia University. The writer is 
indebted to members of his thesis committee, Helen 
M. Walker, Paul E. Eiserer, and especially E. J. Shoben, 
Ir.. Chairman, for their suggestions and encourage 
ment; to Gardner Murpl vho made this study pos 

e by extending his advice, criticism, and the research 
facilities of the Merninger Foundation; and to David 
.. Herman and Joseph E. Brewer for making possible 
nitia!l pilot work at the University of Wichita. This 
study was done while the writer was a research assistant 
at the Menninger Foundatior 

* Now with the Research Center for Mental Health, 
New York Universit 
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fense result from the disrupting effect of the 
threatening stimuli. Anxiety associated with 
the word, for example, interferes in 


hierarchy of 


“taboo” 


some way with the S’s normal 


responses to a partially exposed stimulus. This 
disruption has the effect of reshuffling the usual 
order of prerecognition hypotheses As a result, 
the correct response is given much earlier (low 
threshold) or much later (high threshold) than 
it would occur for the matched control word 
By this reasoning, anxiety is related to both 
threshold elevation and depression, and could 
account for both vigilant and defensive reac 


tions 
These assumptions suggest that words 
known to create anxiety are recognized at 


either higher or lower thresholds than control 
words. The anxiety associated with the threat 
ening stimuli is related to the absolute amount 
of threshold deviation, but not to either Vigi 
When the associ 


ated anxiety is removed, threshold deviations 


lance or defense separately 


are expected to diminish 

Ihe present experiment was designed to test 
these assumptions while taking into account 
the major sources of error mentioned above 
control words were carefully 


Threat and 


equated with respect to length, frequency, and 
ease of recognition before experimental failure 
rhe experimental anxiety was 


the 


was introduced 


with threat 


systematically associated 
words, and its effect independently measured 
rhe following hypotheses were tested 

I. Threatening stimuli are 
recognized at thresholds both higher and lower 


The 


extent of deviation is related to the degree of 


tachistos¢ opi 
than thresholds for matched control words 


anxiety induced by the threat 

II 
then threshold deviations diminish 
ol the 


anxiety 


When the associated anxiety is removed, 
Che degree 


reduction related to decrease in 


is 


METHOD 


['wenty-two adults, five men and seventeen women 


individual two and 
Fach 
equal | 


A] 


tested ir 
one-half he 
three 
ter 


were sessions 


averaging 


urs aplece session was divided into 


two 
their 


approximate art eparated by 


i-minute intermissions were paid for 





NALD P 


rABLI 
rimuLtus W 


were 


inger F« 


watt 

a ches on 

Word were 
and 
until 


d photo 


exposure 
recognition ty] iin ca als- al 
graphed é nd Ww nes ! was 


fitte 


open 
( overing 
inimize ear! 
vutlined 
asured 
is 
rocedure 
tional words of 
individually 
it wa cor 
considered 

e { 


After al 


according 


recognized and paired 


thresholds each control word was 


presented to S on a white card for 30 seconds He was 


instructed to examine the word and mentally rearrange 
DIMLY:MIDLY). He 


its letters (e.g was not directed 


SPENCI 


to form a real word because (a) not control words 


could be meaningfully rearranged; and (6) earlier piiot 
word was 


was a practice 


work indicated that failure to rearrange a 
told that it 


S was instructed to rearrange each 


unsettling, even ll O were 


session. Therefore 
word in whatever way he preferred, as follows 
isualizing words. There 
} 


It is merely a warm 


rhis is an exercise in 
are no right or wrong answers 


up session to get you accustomed to thinking of 


words in different arrangements. The idea is to 


look at a word and to see in what ways you can 


rearrange the letters. Obviously, there are a large 


number of possil 


of. but I don’t expect 


je combinations 


might think 
you to exhaust them all. I 


mere want yu to imagine several arrangements 
for each word. For example, take the word 
DIMLY. You might think of it with the D and 
the M interchanged t ¥ MIDLY 
arrangements wi others will not; 
the end Dake 


I give ‘ mu pr ce rearranging the 


Some 
Mak ¢ 


resuit each word 


nee Car h new 


ination ¢ ou find it; make a mental note 


of the solution and try another. You will have 30 


com! 


for each word 


The DIMLY:MIDLY 


presented to torm a set 


seconds 
example wa tentionally 


which 


his precaution was 


would tavor nonsense 


solutions introduced to dis 
courage +8 
from the 


to differences in their later recognition 


gements 
lead 


No S mentioned 


rom meaningiu arral 


MAINZ 
potentially soluble wort which might 


a meaningful solution and the chances are they would 


because the instructions 


have if they had found one 


by implication, suggest there are few. Moreover, no 


so it may be 


assumed that the control task was, as intended 


S showed signs of boredom or trustration 
, essen 


7 ‘ 


lany neu 

Another, more difficult, practice 
Additional 
were given the 5 with instructions 
f I SPINE: PINES 


word from each (« g£., 
ared quite confident of his ability to 


ra 
task was presented 


following the contro! series words. chosen 
for their solubility 
to mak¢ 
After the | 
form anagrams, he was given the failure words with 
some soluble 
task. He was given these 


All right our 


was excellent. Now I am going to give you 


words added to increase the realism of the 
instruction 

performance on the practice 
performance on the 


list you just finished, you should be 


ist. Judging | 
able 
one new word from each of the words on 
the test list. Tell me 
You will have 
As the S failed each of the failure words, the E would 


Cat h word as 


30 seconds for eacl 
sharpen this feeling by asking “Why aren’t you doing 
better? You did we 


Similar 


on the practice tash and other 
Most Ss became 


and occasionally 


questions increasingly dis 


turbed by the task oflered nonsense 


solutions when they could think of nothing else 
For the majority of Ss the failure task achieved its 
ilts persevered 


In all 


other respects the failure and control tasks were essen 


intended, depressing effect and its re 


for the remainder of the experiment (see below 


tially similar. In each case S was presented with a 


series of words, each of which he attempted to rearrange 


within a 30-second period. Words were matched for 
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length, fre quency, and recognition threshold. The S 
was not asked to verbalize his solutions on the control 


He tended 


because 


task; he could not do so on the failure task 
to form nonsense anagrams on the control task 
of the forced to give similar 


responses to the failure task 


instructions) and was 
because no other solution 
was possible). Failure and control experiences were 
essentially different only with respect to failure, and 
it seems reasonable to explain subsequent experimental 
differences in terms of this variable 

Immediately after the failure, each S was presented 
with an Adjective Check List (ACL) of 145 


adopted from Nowlis ef al. (6). included in the list 


words 


were two groups of words identified by Nowlis on an 


Anxiety cluster and an Elation 


The S was asked to check the 


empirical basis: an 
cluster of 10 words each 
words which best described his mood. He was then 
and after a short inter 


think of 


beginning with Q’’), he was asked to list all words he 


questioned about his feelings, 


polated task List all the words you can 


could remember from the failure and control lists 
After a ten-minute rest period during which the 
experimenter left the room, Ss’ recognition thresholds 
for the fa 
Iria! I] 
each to a criterion of one 
was then informed that al 
were actually impossible and that he had been forced 
to fail because the experimenter was interested in the 


ure and control lists were measured a second 
Words were exposed in random order, 
The § 
the anagrams he had failed 


time 
correct recognition 


perceptual consequences of failure. He was encouraged 
to speak about the When 
the S had expressed his feelings he was given the ACL 


experience at some length 


time 
for failure and control words was again meas 
then 
after another ten-minute rest 


a second and after another interpolated task, 
memory 
thresholds measured a 


ured. Recognition were 


third time (Trial III 


RESULTS AND DISCUSSION 


3efore either hypothesis can be tested, it is 
necessary to establish that soluble and insol- 
uble control words can be combined for pur- 
poses of analysis. Anecdotal reasons for as- 


suming they could be treated together were 


presented earlier and as a further check, 
thresholds for soluble and insoluble words were 
compared on the assumption that if there were 
differences in the anagram experience they 
would appear in the recognition scores. Average 
thresholds after failure for soluble and insoluble 
the 


control words were compared for each S; 
difference for the significant 
(t 1.46; p > .10) 


not seem to 


group 1s not 

Meaningful solutions to 
words did inhibit later 
recognition. Moreover, average failure thresh- 
olds were not significantly different from either 


words 


control 


average thresholds for soluble control 
(f O38: p > 20 
insoluble control words (t 

Accordingly, for each 5S, all 
words were combined and his average 


or average thresholds for 
1.58; p > .10) 
thresholds for 


contro! 
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control threshold subtracted from his 
average threshold for failure words. A second 
difference score was computed from thresholds 


was 


for failure and control words after the explana- 
tion 
words were recognized, on the average, at 
higher levels than control words; a negative 
score indicates the reverse relationship 
Anxiety created by the failure was estimated 
by countiig the number of words checked on 
the Anxiety cluster of the ACL after failure 
and dividing by the total number of adjectives 
checked (Anxiety Ratio I). Anxiety was simi- 
larly estimated from the second ACL after the 
explanation (Anxiety Ratio II). Elation cre- 
ated by the explanation was estimated by 
counting the number of words checked on the 
Elation cluster after the explanation and divid 
ing by the total number of words checked. 
Memory was scored by subtracting the num- 


A positive score indicates that failure 


ber of control words from the number of failure 
words recalled after the failure; a similar score 
was obtained for recall after the explanation. 

In accordance with the first hypothesis, anxi- 
ety expressed after the failure (Anxiety Ratio 
I) was significantly correlated with absolute 
differences between thresholds for failure and 
control words after the failure (rho 57, p 
Table 2). Anxiety was uncorrelated 
with the tendency to recognize failure words 


< .01; see 


rABLE 2 


RANK-ORDER CORRELATIONS BETWEEN 
Major VARIABLES 


Rho 
Anxiety Ratio after failure vs 
Threshold difference after failure (ab 
solute value 
Threshold difference after failure (with 
sign) 
Threshold difference after explanation 
absolute value 
Failure word consistency 
Control word consistency 
Memory after failure 
Memory after failure vs 
Threshold difference after failure (ab 
solute value) 
Threshold difference after failure (with 
sigr 
Elation Ratio after explanation vs 
Threshold difference after expianation 
(absolute value) 
Anxiety change vs 
Threshold change 
p< OS 
en < 01 
For 22 


a rho of 508 is significant at the 1% level 


cases, a rho of .359 is significant at the 5% level, and 
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above contro words (rho Q2 hence, 


perceptual defense” seem unre 
Ss with high anxiety 


at both higher 


anxiety and 
lated 


scores recognized control words 


(On the contrary, 


and lower thresholds than failure words, with 


the absolute de gree ol difference a function of 
’ 3 indicates how absolute 


their anxiety.’ Table 


threshold differences increase as a function of 


anxiety. Since the failure and control words 
were equated with respect to frequency, length, 
and ease of recognition, the differences in per 
ception after failure cannot be attributed to 
these Moreover, 


of verbal report cannot account for the findings 


residual factor suppression 


as suggested by Whittaker, Gilchrist, and 
Fischer (11) because the significant correlation 
is with recognition dey lation, not with recog 


nition delay 
When threshold 


nition of failure and control words seems to be 


ilone are « onsidered, recog 


randomly determined. Thirteen Ss recognized 
failure words at average thresholds below 
control words Vigilant ‘ eight Ss showed 
the reverse, or “‘defense’”’ reaction; and one § 


sts. For all SS, 


sholds are not sig 


did not discriminate between |i 


average failure-word thre 
different from average control word 
(10 


vigilant” and 
findings of Post 


nificantly 
test 


hold » > 3; two-tailed sign 


i 
The pre ent 


thre 
dic hotomy ol 


defensive paralle the 


man and Solomon (7 Chodorkoff (2 ind 
other ind upport the general notion that 
unpleasant stimuli are not necessarily per 


elevated thresholds. Such data are 
usually explained by assuming that some Ss 
rhe 


into 


ind other ‘defensive.”’ 
present findings call this interpretation 


question because of the mixed nature of indi 


vidual responses. No S was exclusively “‘vigi 
lant’ or “defensive” in his recognition of all 
wore rather, each S recognized some failure 
wort ibove, and some below, matched con 
trol words.‘ The random distribution of differ 

| nt irguc i that more xio0u ) hbecause 
the “ne ensitive to the rences between 
failur ind ontrol wore vould iiso «differentiate 
hetweer r oluble « t vord Thev do 
' \nxiet after fa ‘ not gnificantly correlated 
wit! the i te erence betwee! thresholds for 
r a l ible contr word 

The tendet toward recog! g all failure words 
above or below matched control wor orrelated Ol 
with Anxiety Ratio I. Thus, whether ar exhibited a 
preponderance ol \ gilar t”’ of detet ve” re ponses 


not related to anxiety 
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SPENCE 
rABLE 3 
MEAN FAILURE AND EXPLANATION ABSOLUTE 
PHRESHOLD DIFFERENCE SCORES 
Absolute 
I Threshold 
Geena? Difference Difference 
aw re After core After 
Failure** Explanation*** 
Mean Mean)j 
High Anxiety 3.39 3.12 
Medium Anxiety 1.68 1.16 
Low Anxiety 1.20 73 
* Group placement determined by score on Anxiety Ratio I 
** Between groups, x? 6.6, p 
*°* Between groups, x* 1.0, p 
Note ignificance tests based on an analysis of variance 


for ranks described in Walker (1 


ences for ali Ss makes it difficult to argue that 
““‘defensive”’ 
of a particular personality 


a “vigilant” or average response 
is characteristic 
style. The tendency to recognize failure words 
above or below control words appears to be 
dependent on factors other than the particular 
characteristics of the S$ 
Absolute threshold 
pears to be 
Ihe tendency to remember more control 


size of differences ip 


also important in relation to 
rec all 
than failure words after the failure correlated 
50 with the absolute difference 
(p < .05). When of difference was 
taken into account, the correlation dropped to 


10. Thus an S who forgot relatively more 


threshold 


direction 


failure words did not necessarily 


failure words at generally higher levels than 


perc eive 


control words. Forgetting of failure words was, 


however, directly related to anxiety. The Ss 
with high anxiety scores tended to recall rela- 
tively more control words after the failure 
(rho 49: p< 05 These results are strongly 


suggestive of findings by Glixman (3) who 
tested memory for failure under three degrees 
of stress. He found that the number of incom 
pleted (failure) tasks recalled varied inversely 
ituation; 1.e., 


the severity of the stress 


which the 


with 


with the extent to instructions 
stressed the personal implications of the task. 
the 


stressful) 


results bear out his findings to 
that 
words is a function of anxiety 

Did threshold differences disappear when 


as predicted by Hy 


Present 


degree forgetting of tailure 


anxiety was reduced, 
pothesis II? Sixteen Ss had smaller anxiety 
scores on the ACL after the explanation 
(Anxiety Ratio II) than on the ACL after the 


failure (Anxiety Ratio I), but only tenof these 
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had smaller average threshold differences after 
the explanation. Change in anxiety scores cor 
related .11 with change in threshold differences 
The elation scores correlated only 29 with 
threshold differences after the explanation. It 
1S possible that the explanation did not produc e 
relief of a sufficient degree to test the hypothe 
sis properly, in which case anxiety created by 
the failure should still be evident after the 
explanation. A significant correlation (rho 
52; p < .01) between failure-induced anxiety 
(Anxiety Ratio I) and threshold differences 
after the explanation suggests that such is the 
case. Enough anxiety was left unaffected by 
the explanation to influence thresholds in 
Trial III (see Table 3 The presence of 
lingering anxiety is understandable in view of 
the impact of the failure task and the brief 
time allotted to explanation 

The high correlation with initial anxiely 
suggests that because of the cumulative effects 
of the initial failure, the experiment did not 
provide a Satislactory test ol Hypothesis ig 
Threshold and memory scores alter the ex 
planation are probably some function of both 
the explanation and the earlier failure, and 
because of the difficulty in their interpretation, 
the remainder of the paper will be devoted to 
an analysis of threshold changes after failure 
Che high correlation, furthermore, is surprising 
since one would ¢ xpect scores on repeated trials 
to approach some asymptotic limit and argue, 
in consequence, that since thresholds for con 
trol and failure words were originally equated 
in Trial I, they should again approach equality 
alter repeated testing That they did not is 
further evidence of the strength of the experi 
mental failure 
If threshold differences after the failure are 
ated to anxiety, it can be argued that these 
differences reflect the impact of anxiety upon 
the perception ol the failure words. Accord 
ingly, thresholds for failure words should 
change more than thresholds for control word 
from Trial I to Trial IJ, with change a direct 
function of anxiety. Further analysis does not 
support this assumption. Anxiety is not related 
to change n failure word thresholds, but 
rather to lack of change, or consistency ol 
recognition. Contrary to expectations, oS with 
high scores on Anxiety Ratio I recognize failure 
words after failure at thresholds which corre 


ate } ighly with thresholds before the tailure 


The higher the anxiety, furthermore, the higher 
the consistency, and the rank-order correlation 
between intertrial consistency of failure word 
thresholds and Anxiety Ratio | is .38 (p 
.O5). Comparable intertrial consistency of con 
trol word thresholds, before and after the 
failure, is not related to Anxiety Ratio | 
(rho 20). Thus it would appear that 
anxiety is inversely related to failure-word 
variability and unrelated to control-word vari 
ability, suggesting that control-word threshold 
can be interpreted as a recognition baseline for 
all Ss, unconfounded by the effects of anxiety.® 
Apparently, then, the anxious S after the 
failure did not substantially alter his choice o1 


sequence of hypotheses in recognition of the 


failure words. Anxiety stabilized rather than 
disrupted perception. Although urprising in 
view of the assumption mentioned earlier that 
anxiety is disruptive, this finding is in line with 
recent studies of anxiety and habit strength 
Taylor and Spence (9 have suggested that 
anxiety may reinforce the habit strength of 
both correct and incorrect response Applied 
to the present study, their findings he Ip explain 
why Ss who are anxious following the failure 
hould respond to the failure words essentially 
as they did initially. Apparently the failure 
reinforced both correct and incorrect guesses 
for failure words with the result that recogni 
tion threshold alter the failure correlated 
highly with thresholds before failure. The 
Taylor and Spence findings might also ex 
plain the low intertrial correlation of failure 
word thresholds for unanxiou They 


presumably responded more flexibly to the 


failure; some, but not all of their initial re 
sponses were reinforced consequently, their 
thresholds after the failure were uncorrelated 
with earlier threshold 

This line of reasoning needs to be checked 


by future investigation, but one thing seems 


clear. The anxiety associated with threatening 





word ipparently lead to in ink ibition ol 
Intertria ‘ ter # control word threshold 
fror before to after the Lilure ot gnihicant 
different fror ntertrial cor tem ot tailure word 
thresholds over the same period ({ > 2, two-tailed 
sign test [10 which argues against the OSS! t that 
ym of control words alter the tllure wa 
i miative inkage formed ring the 
contr ta Fac ! ht have formed more jution 
t ( tr word tha to fa re Wor ‘ we the 
inagrar tas but tl ere al R r did 
e ans ' fect , ent recognit 
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response; this inhibition may result in either 
What are usually 
reactions to 


” 


“vigilance” or “defense. 


assumed to be two distinctive 


tachistoscopic stimuli seem, on closer exami- 
two varieties of the same under 
The further fact that no S was 


“defensive” in his 


nation, to be 
lying proces: 
exclusively “‘vigilant’”’ or 
reaction to the failure words makes it difficult 
to argue that these ideal response types can 
be used to discriminate between individuals 
If there are 
reacting to anxiety-provoking stimuli, they 


distinctive individual ways of 


seem partly related to the amount of anxiety 


aroused in the S. Further experiments are 


needed using other measures of anxiety to 


establish these findings more securely 


SUMMARY 
[wo lists of words matched for frequency, 
length, and recognition threshold were pre 


sented in an anagram-solving exercise. After 
one list had been failed, all words were ex 
posed tachistoscopically for a second time. 
aroused by the failure correlated 
the absolute difference be 
failure and 


Anxiety 
significantly with 
recognition thresholds for 


Anxiety did not correlate with 


tween 
control words 
either vigilant” or ‘defensive’ responses 
alone, nor were failure words recognized at 
significantly different levels from control 
words when the anxiety score was not con 
sidered. The relationship between anxiety and 
threshold difference persisted even 


nature of the failure had 


absolute 
after the artificial 
been explained, contrary to the hypothesis 
that explanation would erase the effects of 
failure 

Sequential analysis of thresholds indicated 
that 
trol words did not follow a simple pattern 
appeared to stabilize the repeated 


uccessive recognition of failure and con- 


Anxiet) 


recognition of failure words, with threshold 
consistency a function of anxiety. It can be 
argued, after Taylor and Spence (9), that 
anxiety tends to reinforce the use of prerecog- 
nition hypotheses for threatening words while 
not affecting hypotheses for control words. 
This differential stabilization of failure-word 
hypotheses seems to underlie beth “‘vigilance”’ 
and “defense,” suggesting that these response 
types are less distinctive than is usually as- 
sumed. 
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PRACTICE EFFECTS IN REACTION-TIME TASKS IN 
BRAIN-INJURED PATIENTS ' 


ARTHUR &%. BENTON anp HAROLD L. BLACKBURN 


State University of lowa 


EACTION-TIME studies of nonpsychoti 
patients with cerebral injury or dis 
ease have disclosed _ significantly 

slower times in both simple and choice rea 

tions for such Ss when ompared with appro- 
priate control Ss. However, a considerable 
degree of overlap in the distributions of scores 
is typically found, the RTs of at least half of 
the brain-injured Ss being within normal limits 
1). The question arises as to whether the pres 
ence of cerebral dysfunction in these Ss may 
not reflect itself in a diminished work-capacity, 
i.e., In a failure to maintain an initial level of 


performance in respect to speed and stability 


of response with continued activity. That such 
is the case has been asserted by Goldstein (2) 
on the basis of his observations on brain 
injured soldiers during and after World War 
II. He reported that some patients showed 
initial simple and choice RTs that were well 
within normal limits. After the first ten trials, 
however, these patients were likely to show 
both a lengthening in RT and an increase in 
variability, whereas normal Ss did not show 
these changes within the limits investigated 
(30 trials). Thus, a relatively rapid decline in 
efficiency of performance, as reflected by 
slower and more variable RTs, appeared to be 
characteristic of many brain-injured patients 
whose initial RTs were unremarkable 

The purpose of the present study was to in 
vestigate this possibility of differential practice 
effects in brain-injured and control Ss some 
what more systematically than has been done 
heretofore. Matched groups of patients were 
run on both simple and choice RT tasks and 
the findings analyzed to determine what 
changes with respect to mean score and intra- 
individual variability occurred in the course 


of a series of 30 trials on each task 

This investigation was supported by a research 
grant B-616) from the Nationa! Institute of Neuro 
logical Diseases and Blindness, of the National! Inst 
tutes of Health, Public Health Service. The writers are 
indebted to Dr. Robert J. Joynt for valuable sugges 


tions and criticisms 
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METHOD 


Subjects. The brain-injured group consisted of 38 
patients (28 men, 10 women) on the neurological and 
neurosurgical services of the University Hospitals, 
Iowa City, who carried a final and unequivocal diag 
nosis of supratentorial cerebral disease. A variety of 


diagnoses was represented in the group, vascular dis 


ease and neoplasm being the most frequent (55 per 


cent Che mean age of the group was 36.9 years (SD, 
12.0). Mean educational! level was 10.0 years (SD. 2.6 

Che control group consisted of 64 patients (52 men 
12 women) on the neurological, neurosurgical, and 
medical services who showed no evidence or history of 
cerebral disease, head trauma, or epilepsy \ variety 
of diagnoses was represented, spinal cord disease being 


the most frequent (42 per cent). Nine cases were classi 


fied as psychoneurotic with somatic complaints. The 
mean age of the group was 39.2 year SD, 11.5 
Mean educational level was 9.8 year oon 

No patient who was acutely ill, wh behavior 
raised the question of psychosis, who had a history of 
hospitalization for a psychiatric disorder, or whose 


history Suggested mental dehaency dating back to 
childhood was included in the investigatlior 


As estimated by / and F tests, there were no signifi 
cant differences between the two grou with respect 
to either the means or variances in ige or educational 
level 


Procedure. The rgaction-time apparatus provided for 


the presentation ol a lght 


| timulus which was preceded 


by a buzz of two seconds duration, the latter stimulus 
serving as a “read gna Iwo light vhich were 
mounted seven inches apart on the right and left sides 
of an upright panel which faced the patient, were con 
nected respective to microswitche ittached to the 
right and left sides of the base of the ipparatu 

In the simple reaction-time task, the patient, having 


seated himsell comlortably at the table, was instructed 
to rest the index finger of | right hand lightly on the 
right microswitch and to press the switch as soon as 


the right light appeared The function of the bi 





read ignal was explained to him. Five | 
triais were given to acquaint the patient with the task 
rhirty test trials then followed 

In the choice reaction-time task. the patient was in 
structed to rest the index fingers of hi right and left 
hands on the right and left microswitches and to press 
the corresponding switch when the right or left light 
appeared. As in the simple reaction-time ta the light 
stimulus was preceded | a warning signa buzz) of 
two seconds’ duration The patient was warned not to 


press both keys at the same time. Lights on the ex 


perimenter’s side of the apparatus indicated the key 
which was pressed so that double pressing could be 
noted by the experimenter. Five practice trials were 
followed by thirty experimental trials, con isting of 
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fifteer timulations on the right side and hiteen stim 

ator the left le, presented in a predetermined 
random 

I he e reaction-time task was given first and 
vas followed, after a rest period of 2-3 minute the 
che ca t © task The reactio time were 
recorded a atica n .O1 sec i units | in elec 
trica ‘ clock which wa ct ited the “et 
of th ght ul topped tl lepression ot 
the espn g microswitch 

RESULTS 


Simple RT. The means and standard devia 
imple RTs in the two groups of 
for successive blocks of 10 tnial 
lable 1A the 


core are graphically represented in Figure 1A 


tions ol the 


< 


eat h are 


hown in and trends in mean 


noted that while a 
, there 


From inspection, it will be 


difference in over-al RT level is obviou 


does not appear to be an impressive difference 
in the trends over block ot trials, the control 
group howing no change ind the brain 
injured howing a slight, questionably sig 


continued 


RT with 


were confirmed by 


retardation in 
practice. These impression 
variance which 


effect for 


the results of an analysis ol 


howed a highly iwnificant main 

grou / 25.7 with df 1,100; p 001) but 
no significant effects either for trials (/ 0.59, 
with df 2,200) or for the groups X trials inter 
action (/ 1.11 with df 2,200 Thus it must 
be concluded that there is no evidence for sig 


nificant practice effect . the two trends not 


being demonstrably different from each other 


or the combined trend significantly different 


from a horizontal line 
Three brain injured 5 showed 


n R'T of 20 per cent or more from 


s per cent 


an imcrean 


the first to the third block of trials. The mean 
RT's of these Ss for the tirst blo k of trials were 
197, 221, and 273 msec., i.e., two of them 
howed relatively fast initial RTs. Eight con 
trol s 13 per cent) also howed a retardation 


in R'T of 20 per cent or more trom the first to 
the third block of trial 
Choice RT 
tions of the choice RTs in the 
ucce ve blocks of 10 trials 


lable 1B and the trends over trials 


The means and standard devia 
two groups for 
each are shown in 


graphically 


repre ented in Fig. 1B. It will be seen that 
while the control group showed a mean retar 
dation of 3 msec. per trial between the first 

> The designation, nonsignificant wal meant to 
I ite that a result did not attain the OS confidence 


H. L. BLACKBURN 

rABLE 1 
or Brain-INsuRED PATIENTS 
FoR Successive BLocks 
or TRIALS 


REACTION TIME 
AND CONTROI 


Data in milliseconds 


A. Simple RT 


tatistic Brain-injured trials Control trials 
l 11 »| 21 1-10 11-20) 21-30 
Mean 299 | 304 | 306 |208 | 204 | 208 
D 149 141 139 | 41 45 47 
Mean Intra-Indi 59 65 74 | 35 $3 32 

vidual SD 
B. ¢ e RT 

tatisti Brain-injured trials Control trials 
, 1-20 i 1-10} 11-20) 21-30 
Mean 371 382 | 389 1303 | 304 | 306 
SD 97 113 9S 55 43 44 
Mean Intra-Indi 71 73 70 | 55 19 49 


vidual SD 


and third blocks of trials, the brain-injured Ss 


How- 


ever, an analysis of variance indicated that 


showed a mean retardation of 18 msec 


this difference in trend, as estimated by the 
groups X trials interaction, was not significant 
(k 2.03, df 2,200) 
simple RT, 
the over-all performance levels of the groups 


As was the case with 


a highly significant difference in 


was found (F 28.4 with d/ 1,100; p 001). 
The combined trend (trials effect) was not 
significantly different from a horizontal line 
(/ 2.67 with df 2,200). A separate analysis 


of variance of the brain-injured trend alone 
was done to determine if there might be a sug- 
gestion of a effect for 
group. The obtained F was 2.29 (df 2, 74), in- 
dicating that the hypothesis that the trend 
be re- 


significant trials this 


represents a horizontal line could not 
jected 


Five brain-injured Ss (13 per cent 
a retardation in RT of 20 per cent or more 


showed 


from the first to the third block of trials, the 
initial RTs of two of these Ss being relatively 
fast (i.e., under 330 msec None of these Ss 
was in the group who had shown noteworthy 
retardation in RT in the simple reaction task 
One control S (2 per cent) also showed a retar- 
dation in RT of 20 per cent or more from the 
first to the third block of trials 

Levels 


injured Ss 


analysis of performances of brain 


Since clinical observation has em- 
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or TRIAI A. Smepce R1 


ScoRES Over BLocks 
B. Cnoice RT 


susceptibility to fatigue in brain 
injured patients whose initial RTs are within 
normal limits, it seemed indicated to analyze 


further to determine whether such 


Ss might show fatigue effects that were not 


the data 


in the performances of brain-injured 
RTs. To make this 
group of brain-injured Ss 


apparent 
Ss with initially slow 
analysis, the was 
reduced to an N of 36 by random subtraction 


of two cases and then divided into three sub 


groups of equal size (\ 12), defined by mean 
RT on Trials 1-10. This was done separately 
for the imple and choice tasks, with “‘fast,” 
“medium,” and “slow” subgroups in respect to 
mean RT on Trials 1-10 being formed by the 
counting-off method. The mean RTs over 


block 
range of RTs for each subgroup on Trials 1-10 


of trials of these subgroups and the 


for the simple and choice tasks are presented 


in Table 2. l 


spec tion of this table sI ows that, 


[rm 111 


apart from an expected slight “regression to 
the mean” in thé extreme groups, there are no 
important differences in the trends of mean 
scores over blocks of trials, an impression that 
is supported by the results of analyses of vari- 
ance of these trends which failed to show sig 
nificant levels X trials interaction effects, the 
F for the simple and choice RT interaction 
being 1.04 and 1.54 respectively, each with d/ 
4 and 66 

Changes in intra-individual variability. Since 
it has been asserted that brain-injured Ss tend 
to become more variable, as well as slower, 
with continued practice, this aspect of per- 
formance investigated. Intra-individual 
variability was estimated by computing the 
mean SDs of the RTs for each block of 10 trials 


The mean 


was 


in each of the two tasks for each 
SD for each block of trials for the members of 
group was then computed 
will be 


each diagnostic 
These means are shown in Table 1. It 
noted that for both simple and choice RT the 
brain-injured Ss appear to be considerably 
more variable than the control Ss. Moreover, in 
the simple reaction task, the brain-injured Ss 
appear to show a progressive increase in intra 
individual variability while the control Ss show 
no change in this regard. With respect to the 
choice reaction time no important differences 
in the trends of the two diagnostic groups are 
apparent. When these data were subjected to 
analyses of variance, highly significant group 
effects were found for both the simple (F 

13.1 with df 1,100; p 001) and choice tasks 


rABLE 2 

SRAIN-INJURED PATIENTS FOR 
PRIALS, ACCORDING 
INITIAL RI 


Data in milliseconds, range is for first 


REACTION TIME OF 
Successtve BLocKs oF 
ro LEVEL OF 


block of trials only 


imple RT Choice RT 
Levels Subgroup 
Kange Mean Range Mean 

Fast”’ 

Irials 1-10 154-221 19 245-325 297 

Trials 11-20 208 292 

rrials 21-30 17 310 
Medium” 

lrials 1-10 222-S02 256 337-377 355 

lrials 11-20 263 48 

rials 21-30 200 396 
Slow”’ 

rrials 1-10 307-876 466 377-686 477 

Trials 11-20 459 420 

rrials 21-30 10) 475 
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(F 17.4 with df 1,100 p< 061). The trials 
effects (combined trends) were nonsignificant 
for both the simple (F 0.72 with df 2,200 


0.93 with df 2,200 
the 


Significant 


and the choice task (/ 
With respect to 
x tnal 
(/ 3.77 with df 2,200; p < .0S), 
a difference in the trends of the two groups, 
{ 


imple reaction time, 


groups interaction was 


indicating 


e brain-injured Ss tending to become more 


an the controls with increasing 


Howeve r, 


variable tl 


number of trials no difference in 


trends was found for the choice reaction task 
I 0.81 with df 2,200 
Nine brain-injured Ss (24 per cent) and nine 


control (14 per cent) showed an increase of 
100 per cent or more in intra-individual varia 
to the third block of trials 
In the choice RT task, 


and one 


bility from the first 
imple RT task 


two brain-injured Ss 


in the 


per cent 


control .S per cent howed an increase of 


100 per cent or more in this characteristic of 


responsivenes 
Disc ION 

Ihe control Ss show a mean sin ple visual 
RT (207 msec over 30 trials which seems 
lightly slower than that which might be ex 
per ted of healthy Ss but is still clearly within 
normal limits (3). No significant practice ef 
fects, either positive or negative, were shown 
over the three blocks of trials. While the brain 
injured Ss were characterized by a mean 


imple visual RT (303 msec.) which is sig 


nificantly slower than that of the controls, no 


difference with respect to trend of performance 


ver blocks of trials was shown. Their perform 
ance ke that of the control Ss, did not appear 
to ret ignificant practice effects. Moreover, 
this was as true of those brain injured Ss who 
how fairly fast initial RTs (i.e., over the first 


block of 10 trial as it is of those who show 
a retardation in RT from the very beginning 


Thu 


no upport 


, insofar as simple visual RT is concerned, 
is found for the expectation engen 
dered by that 
injured Ss may show an undue susceptibility 


ome clinical reports brain- 
to tatigue 

Che findings for choice RT parallel those ob 
The 
brain-injured Ss were significantly slower than 
the controls did not i statistically 
significant differential practice effect. This held 
for those Ss whose initial RTs were relatively 


served with respect to the simpler task 


show 


but 
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fast as well as for the total group. Here, as in 
the simple RT situation, no substantial evi- 
dence for a greater susceptibility to fatigue on 
the part of brain-injured Ss was demonstrable. 

The belief that brain-injured patients may 
become progressively more irregular in RT 
responsiveness with continued practice is par- 
tially supported by the findings. In the simple 
RT task the brain injured Ss, when compared 
with the controls, showed a significant increase 
in intra-individual variability over blocks of 
trials. However, this increase did not occur on 
the choice reaction task, in which no significant 
difference in trends was demonstrable 

While the analyzed results of the study are 
generally negative, it is still true that most of 
the findings were at least in the direction pre 
dicted by clinical report. In the simple RT task, 
the brain-injured patients showed a mean re- 
from the first to the third 
the 
In the choice RT task 
mean 


tardation of 7 msec 
blo« k of 


showed no retardation 


trials while control patients 
the brain-injured patients showed a 
18 msec as compared 


the control 


retardation of with a 


mean retardation of 3 msec. in 

group. Moreover, in the simple RT task, a 
significant increase in variability with con- 
tinued practice on the part of the brain 


injured patients could be demonstrated. These 
results raise the possibility that modification 
of the task in the direction of making it more 
complex, lengthening the foreperiod beyond 
optimal limits, or increasing the number of 
trials might yield differential 
practice effects. This possibility can be de- 


evidence of 


cided only by further investigation 

However, while the study may provide only 
a limited test of the general hypothesis that 
brain-injured patients tend to show an undue 
susceptibility to fatigue in RT tasks, it does 
constitute a quite adequate test of the validity 
of the contention, derived from clinical impres- 
sion, that fatigue 
effects after the first few trials in simple and 
choice RT tasks. This does not prove to be the 
each 


such patients will how 


case. The findings do indicate that in 
diagnostic group there are a few Ss who show 
a noteworthy retardation in RT or a striking 
increase in intra-individual variability from 
to the third block of trials in either 


It seems likely that 


the first 
the simple or choice task 
clinical observers have been impressed by these 
changes in responsiveness on the part of some 
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brain-injured patients but have failed to con- 
sider the more numerous negative cases in the 
brain-injured group or the occurrence of the 
same type of change in the responsiveness of 
control patients. 


SUMMARY 


A study of performance trends in simple and 
choice reaction time tasks in nonpsychotic, 
brain-injured patients and a matched group of 
control patients indicated that, within the 
limits investigated (30 trials in each task), 
there were no significant practice effects with 
respect to mean reaction time for either group 
in either task. The performances of those brain- 
injured patients whose initial reaction times 
were within normal limits also failed to reflect 
significant practice effects. As compared with 
the control patients, the brain-injured patients 
showed a significant increase in intra-individ- 
ual variability in simple reaction time with 
continued practice. However, this difference 
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in trends was not found in the choice reaction 
task. 

The findings, while generally nonsignificant, 
raise the question as to whether a more com- 
plex task or a larger number of trials might not 
yield differential practice effects not disclosed 
in performance on these simple tasks over a 
limited number of trials. The results are inter- 
preted as providing no support for the conten- 
tion of some clinical observers that brain- 
injured patients are likely to show fatigue 
effects after the first few trials in simple or 
choice reaction-time tasks. 
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A TEST OF THE TAYLOR-SPENCE THEORY OF ANXIETY'” 
ELI SALTZ ann ARTHUR J. HOEHN 


fer Force Personnel and Training Research Center, Chanute Air Force Base, Illinot 


‘Ee of the most timulating and pro number of erroneous responses which, through 
vocative deve opments in recent psy the operation ol chance fluctuations in re 
hology has been the attempt by sponse strengths, interfere with the correct 
laylor and other Spence students to relate a response. Since anxious 5s are assumed, after 
clinically oriented definition of anxiety to Miller (6), Mowrer (8), and Taylor (10), to 
modern behavior theory. Because of the po have higher D than nonanxious Ss, anxious Ss 
tential significance of this development, it 1s should perform more poorly than nonanxious 
important that the laylor Spence theory be Ss in competing response Situations 
rivyorously tested. Unfortunately, some of the Many studies have been performed that 
early experiments cited in support of the theory seem to support the Taylor-Spence theory (1, 
were nadequately controlled he present 7, 9, 11). However, in all these studies an in 
experimen! vere designed to evaluate the crease In response competition 1s act ompanied 
Taylor vence theory under conditions which and confounded by an increase in difficulty 


take into account some of the factors ignored — level. In fact, at times difficulty seems to be 


in the early studic used as an index of competition, as though no 
laylor (10) originally conceptualized mani other sources of difficulty exist. For example, in 
fest anxiet is a drive variable. Her theory one paper Farber and Spence state, \ further 
lollowed the ynventional Hu lan conceptual impli ation of the present theory was that the 
ition of response strength (2) as a multiplica advantage of the nonanxious over the anxious 
tive function of a learning factor (//) and a group would be positively re lated to the level 
drive tactor Dp since anxiou S were theo of difficulty of the individual choice points lof 
rized as operating at a higher drive level than a finger maze], on the assumption that relative 
nonanxiou the pred tion was made that difficulty (number of errors) at a choice point 
more anxious. vould learn faster than would reflects the strength ol competing incorrect 
( inxious one I} theory worked out well response tendencies at that point”’ 1, p 122 
rT imple conditioning experiment anxiety Because of the confounding of difficulty and 
vas found to fa tate the acqu tion of con competition in these studies, any theory based 
ditioned re ponse 10 The theory proved on tne hypothesis tha anxious OS periorm 
inadequate, however, for more complex verbal more poorly than nonanxious Ss as a function 


sk would be upheld by 


earning. Montague (7) appears to have been of the difficulty of the t 
thre mirst to 1s the | ivior cale in a verbal the same data wi ich have been used to date as 


earning study. He reported ittle evidence to evidence for the competing response theory 


indicate that anxiety facilitates verbal learning rhe pre sent studies attempt to control both 
In some conditior he found that anxiety competition and difficulty so as to determine 
uctually interfered wit! ich learning. As a thei respective effects In the first experiment, 
consequence of the Montague study, Taylor competing and noncompeting verbal learning 
and Spence 11) have recently elaborated the materials were selected which had empirically 
earlier theory been determined to have equal difficulty levels 
The revised Taylor Spence theory holds that lor i group ol nonanxious The prediction, 

In a competing response tuation, increased on the basis of the Taylor Spence: theory, was 
drive hould increase the number of re ponse that the anxious should do more poorly on 
hose rengtl ure iprathreshold. Conse the competing material than on the noncom 
uenwy ere should be an increase in the _ peting, since the increased drive of tie anxious 
This resea , ' t under the Air Force Ss should increase the strength of competing, 
Personnel and ‘Training Research Center, Chanute Air erroneous responses. The second experiment 
Force Base, | attempted to compare the periormance ol 
| avd per . Apt, ; Po hi — gory h anxious Ss on easy but competing materia! 
f " i I I f gg t nee t with their periormance on difficult but non 
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competing material. The predic tion from the 
laylor-Spence theory was that anxious Ss 
should learn faster (relative to nonanxious Ss) 
when competition is reduced, even though 


difficulty is increased 


EXPERIMENT | 
Method 





V aterial Ihe use of nonsense sy! ables permits the 
ndependent 1 julation of response competition and 
the difhcuity level of materia joth intralist competi 
tion and familiarity of the syllables contribute to difh 
cult in learning @ list; greater competition iicreases 
ain t £ ALer i arity 4 micuit (on 
S¢ r l Vas possibie to construct a list of familiar 
S able with a grea eal ol tralist co etitiveness 
whic | ve t ea lifficult i imx group 
i i it la ir t ith Little tra t competitive 
Phe etir t i mstructed trom Glaze 
4 bie | association Vv ie An att pt i wie 
t LxImiz ym pe ) l the l va ] 
The f t lett each r q rom the f t hall ot 
the alphabet The Le tte is used as middle 
tte eact ra Three lett each repeated 
three t e ust is the fu letters of ti 
List variou eveis ol G e€ association value 
were t i t was I that, ft 10 i 
er I balanced as to the or r ol prese 
tatior tt ist it ‘ peting t of 13 per 
t ‘ t isa cult as the « eul 
esponse list ribed a rhe meat r of 
t i »a ri I ne periect recitat iS ler 
' } igh 4 rst deci ace r the tw ‘ 
I} mp ist wa ( ructed so that 
the had the ime ett i t 
I the ame ist iett Letter ere the 
entire inge ea habet 
Procedure. G la e teste he 
la 4 i t per a er 0) px t ere 
selected { ‘ the « erime Cor f 22 0ra , 
detined the a | re 1sort d 
the rt anx These cut gz yx t ire a t 
ent i I 1 I ul ! il 
t lowa 
I i ea ( 
‘ ‘ Able t! ng l er 
The peti t re eacl 
earn " er 1 \ 
i ‘ ne re ‘ f I t the 
( ‘ 4 ( et 1% i 
earne ne e ur 
the ‘ 


Result 
Che prediction, based or tne lay or-sSpence 
theory was that anxiou 5 hould do more 


poorly on materia rt n response 0 npeti ion 


rABLE 1 


MEAN NuMBER OF TRIALS TO CRITERION FOR ANXIOUS 
AND NONANxIOUS SUBJECTS ON “Easy’’ MATERIAI 


Learning Material Anxious Non 
anxious 


Noncompeting 13% Mean $1.09 27.83 
association ma SD 13.59 14.64 
terial N 45 18 

Competing 90°) as Mean 28.09 28.44 
sociation material SD 17.54 12.18 


V 45 18 


than on material low in response competition 


if the competing and noncompeting materia!s 
were of equal difficulty for nonanxious Ss. The 
results do not sustain this prediction. Table 1 
shows the mean learning rates. With 45 
anxious Ss in each group, f 907 for the 
difference between competing and noncom 
peting lists; thus the null hypothesis can not 
be re jer ted. In fact, the difference is not even 
in the direction predic ted 

The original equating of the competing and 
noncompeting lists lor nonanxious SS Was cross 
validated with two new groups ol nonanxious 
Ss. The results for these groups, summarized 


in Table 1, indicate that the initial equating of 


list was itistactory The nonanxiou groups 
had a difference between means of .61 trials to 
riterion: f 136 with 34 de grees ol freedom 
EXPERIMENT I] 
VM ethod 
V aterial iz tl ‘ ( ent it wa ece il to 
ne it mmpet it i no iit than 
the « ‘ t | iment I the 
eting list in Expe . 2 is as dith t asa 
eting ut the 13] ent Glaze association 
it ‘ tl lM iit n if pet I iti thi t peri ent 
ted f at the zero lation 
‘ I | r t this | NET tical 
ith those for « truct the 13 per cent a lation 
leve competi t \ is the case in Experi 
ment | thi t ce t ) i é er erie 
Procedure ] nteen inx i i r U On 
4 ore ft the dith t 
t Phese ra ut ra é 
ir | i ca | 1 | | ] ri 
I as Il were run j tantia ta ) 
The ‘ ré t resent t " al 


Result 


The scores obtained by the 1S anxiou ind 


1% nonanxious Ss on the competing response 
list of Experiment I were used as a ba for 
evaluating the resu of Experiment II. The 
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pre diction from the Taylor Pe vence theory was 
that the anxious Ss should do relatively better 


(compared with the nonanxious Ss) on difh 


cult, noncompeting material than on easy, 


competing material. The results are contrary 


to thi predic tion. As can be seen in I ig. 1, the 
deterioration of performance between the easy 
but competing and the difficult, noncompeting 
greater for the 


The difference 


muct anxious 


material was 
than for the 
between these mean differences is 14.50 trials 


nonanxious 35s 


to criterion, significant at beyond the .05 level 


with ¢ 2.05 and 95 degrees of freedom 


JD ISCUSSION AND (CONCLUSIONS 


Neither Experiment I nor II sustains pre 
the the 
These experiments suggest that 


dictions made on basis of Taylor 
open e theory 
the results of previous studies which are often 
cited as support for the Taylor-Spence theory 
may be artifacts due to lack of appropriate con 
trol over the difficulty levels of competing and 
noncompeting materials. In Experiment I, the 
performance of anxious Ss was not worse on 
competing than on noncompeting materials, 
these materials having been equated on difh 
While the logic of 
the 


hypothesis, the indication that the anxious Ss 


cuity for nonanxious Ss 


statistics never permits proof of null 
perform no worse on the competing than on the 
noncompeting material is very convincing. The 


Evt SALTZ AND Artuur J. Hoenn 


V’s are large (45 anxious Ss per group), and 
the difference is even in the direction 
predicted by the Taylor-Spence theory. 

In Experiment II, the results are also op- 
posite in direction from those predicted on the 
basis of the Taylor-Spence theory, and this 
difference is statistically significant. A problem 
arises in connection with Experiment II, how- 
ever. The noncompeting list of this experiment 
was assumed to be more difficult than the 
competing list of Experiment I. This assump- 
tion was made on the ground that the com- 


not 


peting list was as difficult as a noncompeting 
list of 13 per cent Glaze association level ; conse- 
quently, a noncompeting list of zero per cent 
should be more difficult than either the com- 
peting list or the noncompeting list at 13 per 
cent. As can be seen in Fig. 1, both anxious and 
nonanxious Ss do more poorly on the zero per 
cent noncompeting list than on the competing 
list, but this difference is not significant for the 
nonanxious S, though it is significant beyond 
the .01 level for the anxious Ss. Even if one 
assumes, however, that the noncompeting zero 
per cent list and the competing list are of equal 
difficulty, the results are contrary to pre- 
dictions made from the Taylor-Spence theory. 
From these notions, one would expect the 
anxious Ss to do relatively more poorly on the 
competing material than on the noncompeting. 

These findings give no conclusive idea as to 
difficulty affects the performance of 
anxious ‘Ss. However, some indication of how 
difficulty acts is suggested by other recent 


hou 


studies. Lucas (4) reported that the perform- 
ance of anxious Ss deteriorates relative to the 
performance of nonanxious Ss when both 
groups are told that they are doing poorly on a 
task. Difficult may trigger actual 
anxiety in manifestly anxious Ss, and this 


material 


might somehow produce the decrement in per- 
formance. (It should be pointed out that the 
Tayler scale measures susceptibility toward 
anxiety, not necessarily the actual presence of 
anxiety in Ss. This point may be important 
for the interpretation of results obtained with 
the scale.) 

Some recent research relating anxiety and 
intelligence appears to support the general 
conception presented here. Mayzner, Sersen, 
and Tresselt (5), reviewing the literature on 
the relationship between anxiety and intel- 
ligence, found that reported correlations ranged 
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between +.19 and —.40. They presented evi- 
dence which suggests that no correlation be 
tween anxiety and intelligence occurs when 
the intelligence test is taken under nonstress 
conditions. These writers hold that negative 
correlations between anxiety and intelligence, 
such as those reported by Grice (3), may be due 

the detrimental effect of stress on anxiety 
in the test-taking situation 

The above reasoning would suggest that in 
creased difficulty might not be associated with 
relative decrements in performance by anxious 
Ss if (a) Ss were reassured of their good per 
formance, or (6) Ss were in a state of stress 


while performing on the easy material 


SUMMARY 


Previous studies, intended as tests of the 
Taylor-Spence response-competition theory 
concerning the effect of anxiety on perform 
ance, have confounded amount of competition 
and the difficulty level of the material. The 
present studies tested the Taylor Spence 
theory under two conditions. (a) Competing 
and noncompeting materials were equated in 
difficulty fora nonanxiousgroup. Theprediction 
from the Taylor-Spence theory that the anxious 
Ss would perform more poorly on the com 
peting than on the noncompeting material was 
not sustained. (6) Anxious ana nonanxious Ss 
were tested on easy competing and difficult 
noncompeting materials. The prediction from 
the Tcylor-Spence theory was that anxious Ss 
would do more poorly (relative to nonanxious 
Ss) on the easy competing than on the difficult 


noncompeting material. The results were 
opposite from those predic ted and Significant 
at beyond the .05 level. 

rhe results were interpreted as not support 
ing the Taylor-Spence theory 
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BREADTH OF DEVIATE CONCEPTS USED BY SCHIZOPHRENICS! 


LOREN J. CHAPMAN? 


N altempting toa ount lor the intenority 


0} schizophreni criorming col eptual 


tasks, one hypothesis that has been ad 
vanced, most clearly by Cameron 2 1s 
that hizophrenics are able to form concepts 
quite readily but are prevented from exhibiting 


’ 


idequate performance by their overresponsive 


ness to inappropriate stimuli, both internal and 

exter! lwo recent studies by one of the 

present vriter have lent upport to this 

hypothe 3, 4). In these experiments, Ss were 

re i re | to ort card by Common filial ities 

al rure on differe ird there being 

ommunalities between figures that 

thie I ould be orted in more than one 

| thie nstructions Kd ignated the 

( ommunalities for sorting, thus mak 

‘ iSK One ot following nstruction rather 

{ discovering the corre oncept. The 

( muna ( prov ded d ra ng op 
lor | I In Incorre Vay It Va 

ophre h more than 

I tended to ignore rhe nstruction 
1lo rt by these distracting mmuna Y 

f ! the number of these distracters in 

he ileria ncreased chizophren per 

e became | reasingly inferior to that 

eve though the tasks remained 

‘ ‘ i thie imme Th distractibility Vas 

le ! tt ne the ba of the distracter 

0 natit Wa lentity of the figures « the 

Vil ird ind when the f ire posse ed 

! natity by virtue ol i on ept la 

ile ry 

lL hese tudi ho ed that in one Ly} ol 

conceptu orl r task, at least part of the 

hizophren inferior perlormance = arises 
ire yverresponsivent 38 to Inappropriate com 

munalities, 1.e., communalities other than the 

instructed one Che pre el tudy investigates 
The autl i enpte t 1) Ju in H. Path 

i i George Fa I It i ( VAH ita 

for t t taining |t L hret ) ind 
t I (sa el berrazza i I hard Saavedra o 
the C} i { Pi H ince obta 

ing " 
I} t was do t ‘ hor was a 


| mtcdoctoral re earcl té¢ 


of Mental Healt! 


TAYLOR 


Northwestern University 


interfering overresponsiveness in another kind 
ol conceptual task and seeks to isolate other 
situational conditions under which it might 
occur 

It is commonly observed that s« hizophreni s 
tend to be “overinclusive”’ on 
tasks, 1.e 
category many instances which are 
that 


related to or 


concept ual 


sorting they include in a given 


incorrect. 
stated these in 


In addition, it is often 


correct instances are frequently 
superficially resemble objects which properly 
which they are 


belong to the category in 


placed (5, 6). In the present investigation, 


certain aspects of this latter observation are 


subjected to experimental test. Specifically, it 
is hypothesized that the schizophrenic tend 
ency to overinclude in a sorting task is posi 
tively related to the degree to which incorrect 
items share common qualities with the correct 
items In a given conceptual category 

Che 


follows. Normal and schizophrenic Ss were told 


method of testing this hypothesis was as 


the name of a common conceptual category 


(e.g.. fruits) and were asked to select from a 


group of items names of objects belonging to 
7 


at category. A guide card containing thename 


of an object in the designated category served 
as a reminder of the per iied category and also 


of the place where the correct items were to be 


put. The items from which Ss were required to 
select included, in addition to the correct 
instances, incorrect instances which were 


equally divided between members olf a category 


similar to the designated one (e.g. fruits vs. 
vegetables), and members of a category that 
was markedly different from the correct one 
(e.g., birds). It is predicted that schizo 


phrenics would incorrectly include in the desig- 


nated category more similar items than dis 


similar ones. In contrast, normal Ss, if they 


make overinclusive errors at all, should exhibit 


each type ol error equa ly often 


Should hizophrenics exhibit such a 


the 
preference for overincluding similar items, it 
might be said that their basis for sorting tended 
to be inappropriately broad, being made a 
cording to a wider concept than was specified 
In the that 


instructions terminology 
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will be used here, schizophrenics, when asked 


to sort by a given conceptyal ‘‘species,’’ would 
be sorting act ording to a broader ‘genus,’ tl 
genus including not only the species spe ified 
in the instructions but also a second similar but 
incorrect species In the example above, the 
genus of edibles, which included both fruits 
and vegetables, would be inappropriately sub 
stituted for the designated spec ies ol fruits 

In addition to being overinclusive, schizo 
phrenics are also observed to be overexclusive 
in their sorting, i.e., to exclude from a category 


7 


instances that properly belong to it In at 
tempting to account for this type of error, it 1s 
possible that such overexclusion results from 
sorting by a more narrow category than is 
appropriate In the present study an attempt 
vas also made to test this suggestion, adding a 
further hypothesis concerning the conditions 
under which such narrowing might occur. A 
summary of the actual experimental procedure 
might best serve to introduce these hypotheses 
Using the same task materials as in the previ 
ously discussed sortings, E told S the name of a 
conceptual category that was, in the termi 
nology used above, a ‘“‘genus”’ and asked S to 
select from a group of items names of objects 
belonging to that category. The correct items 
available for sorting included equal numbers ot 
members of two constituent species categorie 

he incorrect items were, as before, members 
of a category that was dissimilar from the genus 
or its two species categories. A guide card with 
the name of an obje t from the genus category 
was used to indicate the place for the genus 
card Naturally, the obje t on the guide ¢ ird 


, ' : 
could belong to only one ol the two species 


groups It was expected that WU errors ol ex 


clusion in schizophrenics are due to excessive 


narrowing of the conceptual basis ol sorting, 
such narrowing would be most likely to occur in 
the situation in which a broad category, 
reducible to smatier ones, Vas employed 
Further, in line with the notion that schizo 


phrenics are highly distracted by external 


stimuli present in the situation, the direction ol 


‘ ' 


narrowing might be toward sorting according 


to the species to which the guide card belor ged 
Chus, schizophrenics were expe ted to exclude 


incorrectly from the genus sorting lewer iten 


in the species category to which the guide card 


belonged than items in the species Ca 


which it did not. Again it was anticipated tha 


LHe 


such errors as normals might make would 
reveal no differential tendency toward the two 
types of overexclusion. 

In one of the previously discussed studies of 
distractibility by Chapman (3), the criteria 
for choosing Ss were such that only moderately 
impaired schizophrenics could be used. None 
theless, when the group was divided into two 
groups according to severity of illness, signifi 
cant differences were found, the milder cases 
exhibiting fewer errors. The difficulty level of 
the tasks in the present study was such that 
more severe cases could be accepted. In order 
to determine whether degree of illness could 
again be related to the amount of periormance 
impairment, two groups ol schizophreni SS 
were tested, one judged to be relatively severe 
in degree of impairment and the other, com 
parable to the total group previously studied, 


moderately impaired 


METHOD 


Subjects. One normal and two schizophrenic groups, 


each consisting of 24 males, were tested. The normal 


3s were drawn from the general medical and surgical 


patients at the USPHS Hospital, ¢ hicago, Illinois. No 


idual was included in this group who appeared 


may 


by hospital record or by interview to have a psychiatric 


or neurological disorder The schizophreni groups, 
patients in the Downey VA Hospital, had all received 
1 hospital diagnosis of chronic schizophrenia, with 


their first hospitalizat on tor p ychosi occurring at 


least one year previously and their last at least 3 
mont! previo Other prerequisite lor ection 
were absence oO! a dication of cerebral organic in 
y ement or mental deficiency, no current drug ther 


no EST within the past 3 months, or more than 


35 EST treatments altogether. The two schizophrenic 


gro vere distinguished by their ward status, the 
moderately incapacitated gryup being draw irom 
patient with ground privilege or assignmel! to an 
ut ve deta ancl ine evere incapacitated 
group being draw t itient th neither ground 
| eve nor jutsic \ i nment | i tion 
to Ul 4 in eact the chizophrenic gro 1 
wierate and 18 severe chizophreni ( Liso 
een but were not included in the final grou; hy 1us¢ 
the ver unwilling inabie to participat the 
oxi « ta proce re 
Mea ige is 3/ years for eact i the tl 
While ca ear ( icatio ere 10.4 for tl “I 
erate schizo eni 10.6 r the severe a 9% lor 
he " i Althoug cf i] hid yt er ‘ 
ficant th respect to ese two i ‘ t} 
nhizopl c gro C ‘ hea owe ft 
t g lO a te en y the CY presuma 
ecaus f the « tive efiect I hizophreni 
it} 0 test. the in 1O us 107 the 
£2 wr th ate = yphret 9) 
the 4 ‘ ‘ 
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rABLE 1 
Tue CaTecories anp Guipe Carp Usep in Eacn Task, AND THE Names USED IN THE 
INSTRUCTIONS FOR THE CONCEPTUAL SPECIES AND GENUS 
ask Categories Guide Name for Species Category Name for Genus 
Card Category 

A kitchen utensils pan kitchen utensils, i.e., utensils used for | things found in a 
furniture cooking or eating house 
birds 

B alcoholic beverages wine alcoholic drinks, i.e., drinks that if you | things that a person 
nonalcoholic beverages drink enough of they make you tight can drink 
insects 

( trees oak names of trees growing things 
flowers 
musical instrument 

D fruit apple fruit things that a person 
vegetables can eat 
tools 

I domesticated animals cow domesticated animals, i.e., animals that | animals 
wild animals are kept on the farm or in the home 
recreational equipment 

I wearing apparel skirt wearing apparel things made of cloth 
household linens 
sea creatures 

Materials. Six sorting tasks were devised, each cor a fruit goes in this slot beneath the apple. Every 


sisting of 30 words typed on separate 3” 5” index 
card The words were equally divided among three 
conceptual categories, two ol which were designed to 
be similar but not overlapping (2.2... alcoholic and 
nonalcoholic beverages) and the third markedly differ 


Each of the two 


while the 


ent from the other two (e.g., insects 
here called a “snecies”’ 


rhe third, different, 
Before the 


Similiar ategories 1 


two together are called a “genu 


xt of words is considered as control items 


experiment proper, the materials were given to a 


background to those 
that for 


number of normal similar in 


used in the experiment, in order to insure 


known and could readily 


such the items used were 
be assigned to the proper category The actual cate 
gories used in each of the tasks are listed in Table 1 


A 13” 10” * 4” box was used for sorting the cards, 


the box having two slots in the top through which the 


cards could he deposited A wire rack for holding a 
guide ard was placed on top of the box in back of 
one of the ot 

Procedure. Fach S was required to sort three of the 


i conceptual species, i.¢€., to 


ix task cording to 
separate the designated species cards from the second 
species and the dissimilar control items. The remaining 
three tash were sorted according to a conceptual 
genus eparation of the genus category, made up 
ot the twe pec . from the control items. Each of the 
ix tash could be used for either type of sorting, with 
miy the 1 tructior distinguishing the two 

For both types « worting, / handed S the 30 cards 
of that task amed the category of cards that were to 
be separated out from the others and placed a guice 
car contal g a word illustrative of the pecified 
category on the rack in back of the slot in which cards 
ol that category were to ee placed. Instructions given 
on each task iv be illustrated | the following, taken 

fre the speci lirections for Task ID (See Table 1 
Phi ard up here " upple.”” An apple is a 
! Take t ‘ il ind each one that names 


thing else, that is, everything that is not a fruit 
goes in this slot here 

Except for the name of the particular category and the 
guide card used, these same instructions were used for 
all tasks on both types of sorting. The specific designa 
tions and the guide cards employed for both species 
and genus sortings are listed in Table 1 

Each of the six sorting tasks was used for species 
sorting with half of the Ss in each group and for genus 
sorting with the remaining. The order of presentation 
of the two types of sortings was of the abbaab pattern 
for each S, half of the Ss beginning with the species 
sort. The position of the guide card on the box was also 
varied for each S, alternating from right to left on suc 
Finally, the particular order of the six 
randomized among Ss 


cessive tasks 


tasks was systematically 


RESULTS 


Species sorting. The distributions of all types 
of errors were extremely skewed, especially for 
the schizophrenic groups. For this reason, both 
mean and median error scores were computed 
for each group and all statistical tests were of 
the nonparametric type 

As predicted, the schizophrenics incorrectly 
placed in the designated species category more 
of the similar species items than dissimilar 
words, while the normals made few 
(More specifically, as 


control 
errors of either 
indicated by the median error scores shown in 
Table 2, the severe schizophrenics made 6 out 
of a possible 30 incorrect placements of the 
the moderate group 6, 


type 


similar species items, 
and the normal Ss a median of zero errors. 


When compared by means of a Mann-Whitney 





mn 
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test, highly significant differences were found 
between schizophrenic and normal Ss, the 
comparison between the normal and moderate 
groups yielding a z of 2.76 (p < .01) and the 
comparison of normals and the severe schizo- 


phrenics a z of 2.78 (p < .01). No significant 
difference was found, however, between the 
two schizophrenic groups (p = .43). In con- 


trast to these normal-schizophrenic differences 
for the similar items, the median number of 
errors of placement of the dissimilar cards was 
zero for all three groups and no significant 
differences were found among the groups 

One additional comparison was made in 
order to obtain a more direct measure of the 
tendency of the schizophrenics to overinclude 
differentially on the basis of similarity. For 
each S, the number of incorrectly placed dis- 
similar items was subtracted from the number 
of incorrectly placed similar items. As may be 
seen in Table 2, the median value of this differ 
ence score is zero for the normal group but 
much higher for the schizophrenic groups 
(Examination of the data reveals that the 
lower difference score for the severe schizo 
phrenics, as compared to the moderate group, 
is largely due to the tendency of the severe Ss 
to make more errors of both types, thus lowering 
the magnitude of the difference between the 
two types of errors.) Comparisons of the groups 
on these difference scores by means of a Mann- 
Whitney test yielded a z value of 4.10 (p < 
.001) between normals and moderate schizo 
phrenics and of 3.44 (p < .001) between 
normals and the severe group. Again, the two 
schizophrenic groups did not differ significantly 
(p = .74), both showing an equal tendency to 
include incorrectly similar items. Inspection of 
the errors on each of the six tasks separately 
revealed that this differential error tendency of 


rABLE 2 


MEDIAN AND MEAN NuMBER OF ERRORS ON THE 
SPECIES SORTINGS 


Inclusion 


Exclu 
(,roup = ae 

, . Dis Diff . 

imiUar) similar D 

Norma! Mdn. | 0 0 0 0 
M 54 33 21 58 
Schizophrenic; Mdn. | 6.00 | 0 6.00 1.50 
moderate M 7.33 29 | 7.04 | 3.04 
Schizophrenic; Mdn. 6.00 | O 3.50 1.00 
severe M 8.20 1.33 6.87 | 2.67 


the schizophrenics occurred on each of the 
tasks. 

The schizophrenic groups differed from the 
normal group, it will be remembered, with 
respect to IQ scores, even though they were 
matched on education. Therefore the observed 
tendency of the schizophrenics to overinclude 
items differentially on the basis of similarity 
might be attributed to a lowering of function 
ing intelligence. In order to evaluate this possi 
bility, 19 normal and 24 schizophrenic Ss were 
selected from the total groups in such a way 
that the mean and SD of the IQ scores for the 
two groups did not differ significantly. The 
mean IQ of these selected normal Ss was 105, 
and of the schizophrenics, was 104. When these 
groups were compared, the median score for 
the difference between the dissimilar and 
similar types of error was 0 for the normal and 
2 for the schizophrenic Ss, while the means for 
the two groups were .1 and 4.8 respectively. 
The difference between the two groups, tested 
by a Mann-Whitney test, was significant at the 
001 level of confidence, thus indicating that 
the tendency to overinclude is not accounted 
for by differences in functioning intelligence, 

As a matter of incidental interest, attention 
might be called to a different type of error 
than those mentioned above: the number of 
correct species cards excluded from the desig- 
nated category. From Table 2 it may be seen 
that the discrepancy between thenormal group 
and each of the schizophrenic groups on this 
score was not as striking as on the over 
inclusion measures, but in both cases the differ 
ence was significant at the .01 level. 

Genus sorling. In the genus sorting, it will be 
recalled, correct sorting consisted of placing 
with the guide card all the members oi a con 
ceptual genus which consisted of two concep 
tual species. It was expected that schizo 
phrenics might exhibit a differential tendency 
to exclude items from the designated genus 
category, i.e., to omit fewer of the items falling 
into the same species category as the guide card 
than items belonging to the second correct 
species. Contrary to expectation, however, the 
frequency of errors of exclusion was not related 
to the similarity of the items to the guide card 
Although, as seen in Table 3, both schizo 
phrenic groups excluded a few less of the 
species category to which the guide card be 


longed, the difference in frequency of the two 
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rABLE 3 


MEDIAN AND MEAN NomBer OF Errors or INCLUSION 
AND | Lt or Guipe-Carp AND Non-Guipt 
CARD BCATEGORIE FOR GENT SORTINGS 

Ex 
Grou; Ir " 
r. ( le Diff : 
Mdn. 0 0 0 0 
M 2? 25 00 21 

chizovhrer Mdn. 0 0 0 0 
moderate M 44 1.12 29 1.42 
} } Madr 0 1.00 0 0 
‘ M 1.96 > SO 1 2.00 


types ol error, examines a Wilcoxon te 


did not approaci gnificance for any of the 
group 

Errors of any type were, in fact, relatively 
infrequent in all of the groups on this genus 
orting, most of the medians being zero 
Further, when the groups were compared on 
each of the two types of errors of exclusion and 
on the errors of incorrectly including items from 
the third, incorrect category, none of the groups 
differed gnificantly These figures, however, 
eem overestimate slightly the goodness of 
the hizophrenics’ performance. When groups 


were cor pared on total errors ot ex lusion, the 


severe schizophrenic group was found, by the 
Mann-Whitney test, to exceed the normal 
group significantly (p 02). No other group 
comparison on this measure was significant, 


howeve! This general tendency toward ex 


clusion that ippeared im at least the severely 
incapacitated schizophrenics was paralleled, it 
will be recalled, by a similar tendency in the 
pec orting in which both schizophrenic 
groups significantly exceeded the normal Ss in 
their errors of exclusion. Thus, differences were 
found between normal and schizophreni 
groups on overexclusion measures although the 
ba lor tl behavior has not been isolated 
The ightne of the differences between 


groups on the genus sorting, as compared with 


s Sorting, suggested 


the differences on the pec 


that the genu isk is |e difficult for the 
chizophrenics than the species task. In order 
to check this more dire y, the total number of 
errors made in ea h group on the two types of 


ta } s were compared by a Wil oxon test Both 


found to make more 


zopnreni vroups were 
errors ol pec orting than on genus sorting 
lor the moderately incapacitated, the value 


(CHAPMAN A 


ND JANET A 


TAYLOR 
was 4.04 (p 001) and for the severe, the z 
was 2.83 (p . Ol 
ever, there 


(p 73) 


For the normal Ss, how- 


was not a significant difference 


DISCUSSION 


The analysis of errors of inclusion in the 
species sortings confirmed the hypothesis that 
schizophrenics would incorrectly include more 
items from the category similar to the correct 
In at- 


be- 


one than from the dissimilar category 
the 
it might be said that the 


tempting to describe overinclusive 


havior of these Ss, 
schizophrenics continued to perform concep- 
tually but according to a broader category than 
was specified by the instructions. These results 
thus add further support to the contention that 
schizophrenk inferiority on conceptual tasks 
is due to overresponsiveness to inappropriate 
stimuli rather than loss of conceptual ability 
per se 

Ihe hypothesis that on the genus sortings 
schizophrenics would show the opposite re- 
action-—excessive narrowing of the conceptual 
basis for sorting received no support. Schizo 
phrenics made more errors of exclusion than 
did normals (although only the severe group 
differed significantly from normal Ss), but the 
attempt to isolate stimulus conditions affecting 
these errors was unsuccessful 

One of the interesting results of the genus 
sorting in general was the fact that schizo- 
phrenics made so few errors of any type, in 
clusive or exclusive. In a sense the comparative 
adequacy of the schizophrenics’ performance 
the the 


sorting: when faced with separating categories 


further confirms findings of species 


are dissimilar (as in the genus 
differ 
when the correct items are 
the 


of items that 
tasks), 
little from normals; 


schizophrenics comparatively 


relatively similar to some of incorrect, 


schizophrenic performance suffers although 


that of normals does not. Related to this is the 
unexpected finding that schizophrenics showed 
a more pronounced tendency to exclude correct 
items from the designated category in the 


\ possible 


reason for this is that in the presence of stimuli 


species than in the genus sortings 


that act strongly as distracters, schizophrenics 
become more susceptible to distraction by all 
sorts of other stimuli, internal and external, 
which under other circumstances do not have 
much influence 


Finally, a comparison can be made of the 
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two schizophrenic groups. The severely in 
capacitated group tended to make more errors 
of all types on both kinds of tasks than the 
moderately incapacitated and included more Ss 
wit h high error scores, aS Can be seen esper lally 
from the mean error values shown in Tables 2 
and 3. Although the difference between groups 
in no case reached an acceptable significance 
level, there was, at least, a consistent direction 
of difference, the severely incapacitated JSS 
tending toward a more inferior performance 


SUMMARY 


[he present study attempted to isolate 
stimulus conditions that result in certain types 
of errors being made by s hizophrenics on a 
conceptual sorting task. Normal and schizo 
phreni groups were asked io sort words 
naming objects falling into three conceptual 
categories, two of which were similar to one 
another, and the third dissimilar. In the first 
part ol the study Ss were asked to separate 
items belonging to one of the similar categories 
from the remaining items. It was expected that 
for the schizophrenics, but not the normal Ss, 
the items incorrectly included would more 
often be from the similar incorrect category 
than from the incorrect dissimilar one. The 
results confirmed this expectation, schizo 
phrenics differing significantly from normal Ss 
on a score of number of incorrect similar items 
included minus number of incorrect dissimilar 
items. These errors of the schizophrenics there 
fore may be described as resulting from an in- 
appropriate broadening of the conceptual basis 
ol sorting 

An attempt was also made, in the second 
part ol the study, to determine whether an 
opposite kind of error, excessive narrowing of 


the conceptual basis for sorting, might also take 


place under certain circumstances. The groups 
were again given items falling into three cate 
gories but the designated category which they 
were asked to separate from the other items 
consisted of the two similar subcategories. It 
was expected that schizophrenics, unlike 
normal Ss, might tend to sort according to only 
one of these subs ategories, thus ex luding in 
correctly more of one than the other. This 
prediction was not confirmed, schizophrenics 
showing little difference in performance from 
normal Ss in these sortings 

A comparison was also made of the per 
formance of severely and moderately incapaci 
tated schizophrenics. Although the severe 
group was in no instance significantly different 
from the moderately incapacitated, the di 
rection of difference between the groups 
pointed quite consistently to the relative in 


feriority of the severe group 
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CRITIQUE AND NOTES 


EFFECT OI 


PREINFORMATION UPON HUMAN CHARACTERISTICS ATTRIBUTED 


TO ANIMATED GEOMETRIC FIGURES'! 


RONALD E 


SHOR 


Brandeis University 


HE present study attempts to demonstrate 
that the impression that a person forms of 


another individual’s personality is partially deter 


mined by the characteristics of the other people 
with whom he wen to interact. Following the 
work of Asch (1) and others (2, 3, 5, 6, 7, 8, 9), 


who have demonstrated that preinformation about 
an individual’s personality affects first impressions 


of him, this study attempts to show that such pre 


information also affects impressions of the per 
sonalities of others who interact with him. Ani 
mated geometrical figures were used instead of 


actual person 
In the Heider-Simmel movie 


the stimulus material, three 


(4) which provided 
geometric figures (a 


triangle, 7, a small triangle, ¢, and a small 


about and interact in such a way 


large 
circle, ¢) move 
as to lead most subjects (Ss) to report the action 
as that of story 
Most S 
‘nasty 
attempts to take advantage of c; ¢ is seen as a 


three persons” acting ou! ¢ 
pontaneou ly perceive T as ani peTessive, 


male who picks a fight with ¢ and who 


pirited, who attempts to protect « 


heroic male 


from 7; c is seen as a fearful, defenseless female 
who seeks the protection of f 

It was hypothesized that if Ss 
ferent preinformation about the pe rsonality of one 
of the figures (7 


the other two figures are perceived would also be 


were given dif 


was chosen) the way in which 
affected. Since Ss tend spontaneously to perceive 
T unfavorably it was hypothesized that giving 
them unfavorable preinformation about T would 


either have no additional effect or would tend to 


accentuate slightly the way in which they spon 
taneou perceive the three figures. That is, if 
there were any changes, TJ would be seen as 


braver, and ¢ as more defenseless 


it was hypothesized that favor 


nastier,” ta 
On the other hand 
able preinformation about T would cause a con 
Ss would 


siderable change in interpretation. The 


perceive T more favorably and perceive ¢ and « 


less favorably 


MetTHOD 


In a pilot study 34 members of an undergraduate 


psychology course at the University of Kansas were 


‘This report is based on a dissertation submitted 
in May, 1955, in partial fulfillment of the require 
ments for the M.A. degree at the University of Kansas, 


under Dr. Fritz Heider, thesis advisor 
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shown the Heider-Simmel movie twice and asked to 
describe the personalities of the three figures. Before 
seeing the movie, however, half of the Ss were told 
that 7 was an aggressive person disliked by 
people, and the other half were told that 7 was a fair 
minded person liked by most people. From the Ss’ 
reports the investigator listed all of the trait-names 
used to describe one or more of the three figures. This 
list was then reduced to 40 trait-names on the basis 
of frequency of report and diversity of coverage 

In the present experiment, 300 students from an 
introductory psychology course were shown the Heider 
Simmel movie twice and then asked to check for each 
of the figures those of the 40 trait-names which were 
felt to be relevant. The conditions were identical for 
all Ss with the exception that 100 Ss received the 
aggressive-disliked preinformation about 7’s 
ality (AD group), 100 others received the fair-minded 
and liked preinformation (FL group), and 100 more 
received no preinformation, for purposes of control 
(Cn group) 

The instructions for the AD group were as follows 
“We are going to show you a movie in which there are 
three geometric forms; a large triangle, a small tri 
angle, and a circle, moving in various ways in and 
Please imagine that the two 


most 


person 


around a larger square 
triangles and the circle are people acting out a story. 
We will show you the movie twice and then ask you 
to write out the answers to a number of very simple 
questions about the movie. In no way is this a test of 
your intelligence or personality, and there is no time 
limit. The only bit of information about the 
which we are giving you is that the large triangle is 
an aggressive person who is disliked by most people 
Remember this fact and use it to help understand and 
interpret the story: the large triangle is an aggressive 
person who is disliked by most people.” 

Instructions for the FL group were identical except 
that the words fair-minded and liked were substituted 
for the words aggressive and disliked. In the Cn group 
the last two paragraphs were deleted 


movie 


RESULTS 


In Table 1, the trait-names are arranged a 
cording to the size and direction of difference be 
tween the AD and FL groups in the frequency of 
their choice as de scriptions for T. Since the size 
of each group is 100, these frequencies are also 
percentages. To illustrate, 92 Ss in the AD group 


checked the likable” for T in 


comparison to only 24 Ss in the FL group. The 


trait-name “not 
difference of 68 between these two frequencies is 
the largest in which an AD frequency exceeded an 


FL frequency, and thus “not likable’’ is ranked in 
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Lag po 1 ; first place. An asterisk indicates that this difference 
REQUENCIES FOR Eacu Fiat a or TRatt is significant.? The trait-name ranked last is the 
E ASCRIPTION IN THE AD anp FL 
LD Groups one in which the greatest excess of an FL over an 


AD frequency occurred .* 
Figure As can be seen from the table, where more Ss 
in the AD group than in the FL group see T as 


l'rait-names T t r 
possessing a certain trait, a very strong tendency 
AD FL AD FL AD FL exists for fewer AD than FL Ss to see this trait in 
L te the other two figures. The inverse is also true 
y ] ? ) oa > 
fore sm some! oa 24 . - 4 ‘ : +3 ¢ Moreover, differences between AD and FL group 
roublemaker > “ 52 2 . an 
told Bully , o1 45 * » 13° 0 0 frequencies are greatest for 7, fairly large for 4, 
ost Bad person 57 15 * 118*)| 0 6 and small but observable for c. Where the differ 
lair Showofi 68 29° | 13 30* 2 5 ences tend to be large for T they also tend to be 
s’ . rous ; * » * " , 
> l'reacherou 45 11 u 01 0 6 large for t and c, and where they are smaller for 7 
nes Defiant 48 22 21 31 5 8 ; . } hers al I} 
, Boastful 43 21 * 4 4° 0 . they tend to be smaller for the others also 1uUs 
lis ‘ < é 
sale Unsure of self 48 26% | 18 31% 1/155 52 there is a clear and consistent tendency for im 
Aggressive 90 70 * | 23 35 4 5 pressions of ¢ and ¢ to vary together and in an op 
. * . 
= onceited +4 16 re Z ~ = A. posite fashion from impressions of 7 
. owardiy 25 8 2 ) 2 2 
er ¥ oo % 14 12 33 13 19 The trait-names were dichotomized by the in 
cas z -~ 
ich Hot-tempered 97 86* 219*! 0 6 vestigator according to whether they indicated 
ere lricky 23 12 20 38 *| 2 8 favorable or unfavorable impressions of per 
. » ) > 7 
or Fickle 14 8 - 9 ‘ 22 21 sonality. In Table 2, frequencies for the unfavor 
he Curious 15 14 | 32 17 0 31 ‘sagt rea dpes. hich sig 
; Defenseless 2 1 21 24 89 70 * able and favorable trait-names in which significant 
ed Timid 0 O 17 17 82 76 differences between the AD and FL Gruops were 
_ Dependent 13 14 15 17 63 65 found with respect to T are combined to yield a 
re Chel oil 7 7 = 
ol — > ‘7 rH + ; . favorable and unfavorable average frequency for 
' *rsistent ( 69 
( ‘ lit le 16 1 4 8 18 28 each figure for the three experimental groups 
ul ) “- c ~ 
s Female 4 10 9 8 76 72 In the Cn group where the stimulus configura 
re Fun-loving 1 8 32,25 33-23 tions of the movie operate alone without any pre- 
ave bene 7 * , * , 
‘I Minds own busi 2 9 30 é 8 27 information, 7 was viewed in a generally unfavor 
ness 
id eee le 0 8 45 17 * | 67 38 * able light; ¢ and ¢ in a generally favorable light 
% Clever 8 16 41 38 17 10 Ihese findings conform to expectations. In the AD 
y ‘ * ‘c * 
u Se = : - ato 28 ‘ + 7 . group provided with unfavorable preinformation 
onest 0 $5 7 25 
le Laval 0 10* | 59 28 */ 31 29 about 7 these same relations were preserved, and, 
of Good natured 0 13% |44 17° 39 21 * except lor c, were intensified. In the FL group, 
, * * * , 
€ Likable 0 14 63 23 62 40 however, where the preinformation about 7 is 
€ ( ourageous ) 23 . 99 31 - 5 3 Vv he \ ‘ ‘ " | 
Brave 4 24* | 61 33 * 6 3 contrary to the way most os spontaneously per 
Fair-minded emi Ss UMTia as ceive the film, 7 was still more frequently seen as 
* . * 
| Good person 1 24 66 30 65 33 unpleasant than pleasant but many more Ss per 
Has spirit 40 68 * | 78 72 27 «(24 : 
e Dratective 5 33 %1 66 36 * g 7 ceived T as possessing favorable traits and fewer 
Fasygoing except 14 58 * 43 17 * 11 6 perceived him as possessing unfavorable traits 
when provoked than in the AD group. The other two actors were 
| © DiSesence signiicant at 9 also still seen as more pleasant than unpleasant 
? Significance was accepted at the 5 per cent level 
TABLE 2 In most cases, however, probabilities were consider 


ably less than the 1 per cent level 


AVERAGE FREQUENCIES OF FAVORABLE AND 
* Control (Cn) group frequencies are not included 


UNFAVORABLE TRAIT ASCRIPTION FOR 
Eacu Ficur: in this table. In most instances Cn frequencies are in 


between AD and FL frequencies, although much 
closer to the AD frequencies. To save printing costs, 


Figure 
, , a table giving all of the data and the results of tests 
peri 

n par 7 t r of significance for all three experimental groups has 
Group been deposited with the American Documentation 
Fa Un Fav Un Fa Un Institute. Order Document No. 5034 from ADI Auxili 
lav lay fav ary Publications Project, Photoduplication Service, 
: . 3 ra Library of Congress, Washington 25, D. C., remitting 

AD 6 63.5 54 g 7 28 7.6 dvance $1.25 for microfilm « $1.25 { } 
| ' ) 2) Q 7 2 3 9 17.9 11.1 in advan 4 10 n mim or Z or photo 
; =’ ¢ cc 19 8 12.2 2 9 79 copies. Make checks payable to Chief, Photoduplica 


tion Service, Library of Congress 
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figures to real people in everyday | ituations, 

the confirmatory results obtainec nd to indicate 
by th that the impressions formed of an individual may 
Heider be a function of the characteristi iscribed to 
aflected another person seen to interact wit! 


er 
l 
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NE characteristi the hospital ed chizo lems, is ol importance tor per onality theory If 
phrenic is that generally he has been unable for example, the inability of the schizophrenic to 
to achieve OCIALLY acceptable olution to the engage in effective problem solving behavior is 


’ , nd 
problet of dealing h individual limited to problems involving human stimuli and 
Whether tl inability 1 mited to certain classes 


volving human stimuli, 


not other classes of stimuli, it would be difficult to 


account for this fact with a theory of personality 
t solve prol 
_ I ‘ that stressed the importance physiological 
The present study wa conducted while the status and failed to consider adequately the effects 
authors were at Allegheny College, Meadville, Pa., and ol interper onal relations 
the VA Hospital, Marion, Ind., respectively. The rhe present study was undertaken to ascertain 
authors wish to express their appreciation to Mary 
Jane Curry and Marie C. Morane both of whom 
participated in the collection and organization olf the , 
data, and to the staff of the Warren State Hospital havior when the stimuli involved are of two types 


the relative ability of schizophrenic and normal 


ubjects (Ss) to engage in problem-solving be 


Warren, Pa human and nonhuman 








nce 
la 


1h 
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TABLE 1 
MEAN DIFFERENCE BETWEEN NORMALS AND SCHIZOPHRENICS MATCHED ON ONE OF THE TWO CLASSES OF 
STIMULI AND COMPARED ON THE OTHER 


. Mz - , LO- ny dire 
Matching Comparison — N aoa Normal Mean of I oe 
ariabl ariab + : Mes ) ' 
Variable Variable Mean (Pairs Mean Mean Diff Level Difference 
Abstract Human 3.57 45 3 3.78 53 2.82 01 N > S* 
Human Abstract 3.42 44 3.69 3.40 9 1.40 n.s S >N 


* N.: Normal. S: Schizophrenic 


METHOD 


Although not a study in information theory per se, 
the approach used here to the measurement of problem 
solving behavior was suggested by the work of Miller 
(2, 3) and others (1, 4, 5). The task involved the presen 
tation of 32 pictures from which S was required to 
choose, on the basis of information supplied by Z, the 
correct one. One set of pictures was heavily weighted with 
human content; the other contained only pictures 
devoid of human content 

Subjects. Fifty patients from the Warren State 
Hospital, Warren, Pennsylvania, and 50 students from 
Meadville High School, Meadville, Pennsylvania, 
served as Ss. Each patient had been diagnosed Schizo 
phrenic Reaction and was judged by the hospital staff to 
be in good contact. No history of psychiatric disorder 
was present in the student population. Each patient 
was equated with a student on the basis of either IQ or 
grade achievement.? 

Procedure. In both the human and nonhuman prob 
lems the pictures were selected so that when five bits 
of information had been given to S, he could locate any 
given picture out of the 32 arranged before him. At the 
outset of the study, standardized instructions explain 
ing the test were read to Ss, and they were encouraged 
to ask questions. All questions were answered fully and, 
if necessary, the directions were repeated. During the 
problem, Ss were not told the extent to which they 
were correct or incorrect. Both groups were encouraged 
to listen closely and to do their best 

In the problem with human content, half of the 
pictures contained a male and half contained a female 
rhe 16 male pictures were equally subdivided in the 
following manner: tie, no tie; book-in-hand, no book-in 
hand; glasses, no glasses; and hat, no hat. The 16 
female pictures were equally subdivided in the following 
manner: scarf, no scarf; book, no book; glasses, no 
glasses; and coat, no coat. When S was presented with 
the 32 cards containing human stimuli, he was given 
five bits of information, one bit at a time. Following the 
presentation of each bit, S was asked to indicate which 
of the 32 cards it might be. Thus, if the first bit was 
the statement,“It’s a male,” and if S selected a card 
with a male figure, he was credited with one bit. He 
might then be told that “the person for whom we are 
looking is wearing a tie.” If S correctly indicated a 


male with a tie, he was credited with a second correct 


* No attempt was made to equate for age since pre 
vious exploratory studies by the authors indicated 
Pearsonian rs of very low order between performance 


on the task and age 


bit. Then he might be informed that “the person for 
whom we are looking is carrying a book.” Proper 
identification of a male with a tie and a book earned an 
additional bit of credit. This procedure was continued 
through five bits, and each S was given a score from 
zero to five, depending upon the number of bits of 
information about the cards which he could effectively 
utilize. The initial bit of information was alternately 
male or female. All Ss were given two trials with 
cards containing human content and two trials with 
cards containing nonhuman content 

The cards with nonhuman content were designed to 
parallel the human content cards and the problem was 
presented in the same manner. These cards, drawn with 
a Leroy Lettering Set, were divided into the following 
five bits: (a) printed in blue or red; (6) letters or non- 
letters; (c) if letters, big or little, and if nonletters, then 
numbers or figures; (d) printed on the top or bottom of 
the card; and (¢) encircled or boxed in. Before present 
ing these cards to Ss the difference between letters and 
nonletters was explained to them. Numbers and 
figures were pointed out and the general differences 
noted 


RESULTS AND DISCUSSION 


For the purpose of comparing the two popula- 
tions, each schizophrenic S was matched with a 
normal S on the basis of his performance on the 
nonhuman task. These Ss were then compared by 
¢ tests for matched groups for performance on the 
problem involving human stimuli. When the 
groups were compared for performance on the 
nonhuman problem, a similar technique was used 
except for matching on the basis of performance 
on the task involving human stimuli. An inspection 
of Table 1 indicates that when Ss were matched 
on the basis of their performance on the nonhuman 
problems, a significant difference (p = O1) was 
found between schizophrenic and normal Ss in their 
ability to deal with the human pictures. When 
comparisons were drawn between the same two 
groups, matched for performance on human pi 
tures, no significant difference existed between 
them in their ability to solve problems with non 
human content 

It is apparent that the observed difference be 
tween normals and schizophrenics could be a 
function of either the stimulus class or the level of 
task difficulty. It is conceivable, for example, that 








12% Rospert H. Davis AND 
the ferio periormance ol chiz phr cs on the 
hu in) tas) Na it tunctio of the increased 
difficulty of. the in task dependent of the 
timulus Cla There are two « derations which 
indicate t t the crucial ifiabie wa timulus 
Cla ather than level } probiem dithculty. On 
the one | 1, comparison ol the perlormance otf 
the orma ly « the two classes of stimuli 

licate y hceant difference with re pect to 
mea core | 7 (nonh i ind 3.78 (human), 
and 3.43 (| t ind 3.40 iman). Matched 
t computed tor botl ets ol mean were not 
wnificant (1.08 and .1 respectivel lf the two 
tasks differed in level of difficult e would expect 
this difference to be apparent in the performance 
of normal them. Norma might not show 
uch a diflerence, however, if both the abstract and 
human task vere SO Ca that asymptotic scores 
were obtained both \ pection { Table 1 
indicates that normal did not obtain score ) 
near the upper limit ()}) as te allow the conclusio 
that the task e except ea for either 
Ki ip | , 

These re ts appear t ite that the prob 
len ol Za t ol ct yp re ( a tunctio 
of the cla i stimul h which they must deal 
ind not ne i 1 deficiency in problem-solving 
ability per se | ich classe were employed n 
the present study. One of these classes involved 
huma timul ind the other nonhuman stimuli 
Had the bee compared using other classe ol 
timu ‘ exual nbols, mother figures, ag 
gressive scene different racial groups, et the 
results might well have been different. In other 
word i tec! jus imilar to the one used in thi 
tudy ! mt p ce 1 method for olating 
classe f stimuli which consistently and differen 


also calcu 


* Test-retest reliability coefficients were 
lated for these tasks. With respect to the human 
stimuli, the coeflicients were .72 for normals and .66 


for schizophrenic 
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tially disrupt or impair the problem-solving 


ability of psychologically distinct groups of people, 
judiced and non 
Such 


e.g., normal and psychotic 
pre judiced ; extroverted and i 
used to test various 


could also provide 


pre 
troverted, etc 
information could be theories 
of personality and important 


data to the practicing clinician 


SUMMARY 


normal and sch 


Ss to 


involving human and non 


When 5S 


The hrenik 
deal 


human 


ability of 


with information 


rT 
IZOp 


stimuli was compared were 


matched on the basis of ability to utilize informa 
tion about nonhuman stimuli, it was found that 
the problem-solving behavior of schizophrenics 
wa ignificantly disrupted by the presence of 
human stimuli in the test situation. When the same 
populations wert matched for ability to solve 


problems involving human stimuli, no significant 


difference was found in their capacity to deal with 


information involving nonhuman stimuli. The im 
portance of this unilateral disruption of problem- 
solving behavior for personality theory was pointed 
that the 


could be 


out, and it was suggested technique 


utilized in the present tudy used to ex 


plore the effect of numerous classes of stimuli on 


normal and scl izophre nic behavior 
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d | scale measure oO! a ithori How IS thi to be reversed? (one could Say, The 
ity trend 1) all of the iten sex life that goes on today even in place ; where one 
wile direction, such that the least expects it, 1S nothing compared to the wild 
authoritarian tends to agree with all item sex life of the Greeks and Roman Chus reversed, 
the onauthoritaria to disagree Cronbach the item would still seem to appeal to the prurient 
+. 5) has marshalled considerable evidence to moralizing that the initial item sought to tap 
cate that total scores on tests constructed 1D Better perhaps to state it, ‘Neither was the sex 
ul la ( ( tain ome tematic wiance life { the Greek and Roma _ nor is the ex lite 
adhering tt the content of the iten but ol today a wild and immoral as } commonly 
rather t i esp r et ol acquiescence thought * In the end wt decided to leave th item 
ependent of specific item content. Consistent alone 
dividual difference na rey itedly bee 10 d In the first study we employed ten item in they 
the tendency t igre th iter general i d ( ginal form” a d ten reversed item Because 
{ t controlled test construction, t uC these reversals are an ¢ ential part of the evidence, 
quiescence bias ca y erat able reliabilitu tne are present 1 in full as the first 10 items in 
witl tests and « elatio between tests that Table 1. which also indicates the test form and 
ure ep ent the react ol pers to tne iter iumber of the original item in The Autheri 
pec fic content tne ter While is Berg 4 larian Personality 1) from which each is mod 
gyest i niescence may in it elf vield a usel il fied. It ( e analyse these two set i item ire 
pe slity measure, it often leads to a misint employed as one 20-item test, but mot frequently 
pretat ol finding v rie col tent a lual the ire treated a eparat 10-iter tes! vith 
differences are attributed to item content rather the original wording being de ited as F +-, the 
than t tl ore eneral trait The tandard reversa is I All iter have been scored in the 
precauti iva t ich respx e bia 1 attitude suthoritarian directio 
re earcl t vrite tems rded in both rec In the first tudy, these 20 items were included in 
t that the ideal high and low scorers must a longer questionnaire and administered to 134 
agree with hall the items and disagree with the tudents (69 male, 65 female) from the introduc 
ther The pre ent paper re wrts on two efiorts t tor course n p chology at Northwestern 
sO I the F scale by reversing the wording of | University, fall, 1953 Five levels of response were 
ome F scale iter flered: strongly agree, agree, undecided, disagree, 
| I t attitude cal there i peciha le trong! disagree but tor coring purpose the 
social object pre ted, al r th reactior te t resp were dichotor red aS agree and disagree, 
and it a mpi itter to pre ent both favorable the decided re pons on each item being in 
and unfavorable ite With the F scale the prob cluded th the le frequently chosen of the two 
ler much more difficult, and the present ter [he F+ scale had a reliability‘ of .69, the I 
can make o dele n { the partic ilar reversa cale a reliability ol 1 The correlation between 
used. Take for example the item The wild sex the NaS positive a d in the proper direction but 
life { the 1 Greek and Roma vas tame Va 17 gnificant at but the .05 level. To 
compared to some of the goings-on 11 this countr explore this failure in item revert al at the item-by 
even place where peopl mignt lea t expect t tem level, tetrachori« correlatior vere computed 
n a ' ii a sate lee tts atl between the I tems and the F + total score 
ll study was upportea 1 py y in l a ’ 
States Air Force ee cael No. AF 18/(600)-170 These value we hown in Table 1. Note that 
monitored by the Crew Research Laboratory, Air Force the highest vatue w) he wrong direction 
Personne] and Training Research Center, Kandolph licating that the high-F+ person tends to 
Air Force Base, Randolph Field, Texa Permission is agree with the item. As checked by x? th if 
grante 1 for reproductior transiatior publication t p ignihicant at the 03 level. Wherea the 
use and disposal in whole and in part | w for the 
United State Government * These were items 1, 13, 19, 26, 31, 37, 38, 42 of 
?This manuscript was written while the senior Form 40, and 55 and 67 of Form 78 
suthor was a post-doctoral resear h fellow of the ‘All reliabilities were computed by an internal 
0 ten ethod of Gulliksen (6 


Nationa! Institute of Mental Healt! 
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TABLE 1 
Ine F Irems AND THerr CORRELATIONS WITH SCORE ON THE F+ SCALE 
* 
F+ Version ‘ First Second 
I Version , 4 
ae Seam Testing Testing 
40 9 When a person has a problem or worry, he should try to think it 30 
through, rather than try to forget by keeping busy with cheer 
ful things 
My 2 It takes more than will power to conquer most obstacles 11 
“wo 44 It interferes with learning and effective work when our teachers or -.10 
bosses outline in detail what is to be done and exactly how to go 
about it 
Hi) 27 It would be better if people admitted to themsel ves that they do 12 07 
not always feel a great love, gratitude, and respect for their 
pa ents 
) 6 In spite of what some people say, human nature does not make war 18 06 
and conflict inevitable 
HM) 21 It would be a good thing if more young people kept their youthful, 18 12 
rebellious ideas as they grew older 
0) 23 Strong leaders in whom people put complete faith are not what our 09 12 


country needs today 
i) $4 Geeting rid of all of the immoral, feebleminded and crooked 28 06 
people would not actually solve many of the country’s prob 


le ms 
1 41 rhe artist, writer and philosopher are probably more important to 28 10 
society than the business man and manufacturer 
40 45 Familiarity breeds respect more often than it breeds contempt 15 04 
0) 2 With most men it is luck and help from others rather than will 16 
power and drive that determines what eminence they attain in 
life 
Ww) 8 It is unnecessary and undesirable for a person to have complete 06 


faith in some supernatural power whose decisions he obeys 


wi hout question 


1 12 Even though a person has bad manners, habits, and breeding, he 02 
should still be able to make himself acceptable to other people 
to 16 It is completely absurd to believe that some people may be born 03 


with an urge to jump from high places 

if 18 Even though nowadays many different kinds of people move i6 
around and mix together a great deal, it is foolish to worry 
much about catching an infection or disease from them 

A 44 It interferes with learning and effective work when our teachers and 20 
bosses assume too active a leadership and tell us exactly what 
tod 

78 24 There have been other ages with more instability, conflict, and 07 
imminent danger than ours 

i* $§ It is a good thing to maintain the present emphasis in colleges on 03 
invellectual and theoretical topics, rather than change the em 


phasis to pr ictical matters 
* Por original items, see (1 
original wording the high-F person was anti-intro 16 F+* and IS F items were employed. The F 


ceptive (When a person has a problem or worry, it items included the 7 of the first set which showed 
or him not to think about it, but to keep positive correlations, plus 8 new reversal attempts 


is Dest I n 

bu with more cheerful things), by agreeing with These items are also shown in Table 1. These 31 
the reversed wording (first item, Table 1) he items along with others were included in a ques 
appeil troceptive. What is in common between tionnaire administered to 144 male students from 
the two wordings is that both are plausible de introductory psychology courses at Northwestern 
clarative statements. Items 34 and 41 dealing with University in the fall of 1954. The reliability of the 
getting rid of the immoral and feebleminded, and F + items was .53. The reliability of the F items 
the value of artists vs. businessmen, seem to have was .41. The correlation between the two scales 
been more successft reversed, and are now in a was 01. Again, tetrachoric correlations between 
form in which the high-F person tends to disagree 

with then * These were the 10 items used in the first study, 


In the second effort at reversing F scale items, plus Nos. 4, 9, 25, 33, 35, and 39 of Form 40. 











RESPONSE SE1 
the F + total and the | items were computed 
As can be seen from Tabk 1, the values are low 


and predominately negative.* Although we have 


explored but a small sample of the ways in which 
thes« and other F s« ale items could be re versed, it 
seems tair to conclude that the achievement of a 
uccessful reversal would be a | 

It was possible in the second study to obtain a 


rather pure measure of the acquiescence responst 


set through a special scoring of other questionnaire 
items administered at the same time. These items 


were intended to measure (a superior orientation 


or a tendency to identify with one’s superiors 
rather than with one’s subordinates in situations 
involving cross pressures from these echelons: 
(b) alienation, or a tendency toward cynicism about 


the sincerity of other persons and conventional 


values; (c) identification with disciplining author 
ity; and (d) preterence for cooperation and doing 
things in groups. For each of these topic an 
equal number of positive and negative items were 
constructed, to balance out the acquiescence r 

sponse set Rescoring the item for a general 
tendency to agree should thus produce a measure 
of the response set in which content factors were 
balanced out. Based on 74 items, this measure 
showed a reliability of .63. It correlated .43 with 


F +, 06 with F —, and .32 with total I 


DISCUSSION 


If the F scale were free of re ponse-set variance, 
one would expect F+ and F to correlate pos! 
tively, to a degree approaching their reliabilities 
On the other hand, if the systematic variance 
picked up by the F scale scores were only acquies 
cence response s¢ 4 then F 4 and I should show 
a high negative correlation. The zero or low 
positive correlation found indicates that neither 
extreme is true, and that the typical F scale 
score probably contains both response-set variance 


and variance adhering to the content of the items 


But without rejecting this conclusion we can 


note that it does not suffice for the data pre ented 


On this basis taken simply, F hould correlate a 
highly (th yugh negat vely with the pure response 


® The difference between the two studies in levels of 
correlation found may be due to the fact that the second 
study was based on males alone. In general, in our 
experience in this area, correlations and reliabilities run 
higher for women than for men 
In a further exploration, these items were split into 
40 worded in the first person (“I feel,” etc.) and 34 
worded as declarative statements, as in the F scale 
The acquiescence score for these had reliabilities of .46 
and .47 respectively, correlated with each other .45, 
and correlated with F+ .41 and .33. The difference in 
F+- correlations was not significant by the Pearson 


and Filon test (7), and was not in the expected direction 
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set measure as does F+-. It has about t! ame 
number of items, and all items are worded yr 
direction. But th outcome does not appear. It 
eems necessary to consider that the acquiescence 
response set Variance iS correlated po tively 
with the item-content variance of the I cale 
rhe trait of agreeing with items seem to be 


symptomatic of a more general personality 
syndrome also symptomized by belief in the F 
scale content measured independently of | the 
response set. Our best evidence of this is the highly 


) 


significant correlation of .32 between the pur 


response-set measure and Total F in which re 
sponse-set variance is presumably canceled out by 
the combination of items worded in both directions 


On this basis, we would interpret the correlation of 


43 between the response-set measure and F + as 


due to the summation of the effects of response 


set in the F + items plus content in the F + item 


lhe I items are so worded that these two source 
of variance each potentially correlated with the 
pure set measure, subtract from each other and 
cancel each other out, resulting in the 06 


value Phi interpretation provides in addition 
rationale for the lower reliabilities of the | 
»¢ ales 

rhis interpretation by no means implies that the 
F scale will correlate only with test scores con 
taining the re ponse set Ample evidence to the 
contrary is available, in which the F scale corre 
lates significantly, if not highly, with measures of 
quite different construction (3). However, it doe: 
imply that the correlations with test imilarly 
constructed will be higher than they would be 
were the response-set bias controlled as by having 
items worded in both directions, by multiple 
choice format, or otherwise. This observation ha 
important implications for some of the absolute 
levels of correlation reported in the initial study 
(1). Like the I cale, all of the Ethnocentrism 
iten were worded in the same directix thus the 
acquiescence response set must raise the correlation 
above what it would otherwise be. For the Political 
and Economic Conservatism scale, on the other 
hand, items worded in both the conservative and 


the liberal direction were employed In thi 


fashion, as well as in terms of the smaller number 
of items used, the dice were inadvertently loaded 
against finding political-economic attitude a 


important a correlate of ethnocentrism as the F 


cale measure of per onality 


* Since this paper wa prepared two other | cal 
item-reversal studies have appeared: Leavitt, H. J., 
Hax, H., and Roche, J. H. “‘Authoritarianisr and 
agreement with things authoritative. J. Psychol., 19 
40, 215-221, and Bass, B. M. Authoritarianism or 
acquiescence, this Journal, 1955, 61, 616-62 Both 
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UMMARY 

\ atte pt was ma to eva ite acquiescence 
resp e-set sriance } the |} cale measure olf 
authoritarian personalit tre by employing 
eve ed ordings ot ite Re ponse et 1s found 
to be an important factor in the F scale whicl 
cann casi be corrected by the practice u ual 


truction ol both 


( mpioying 


positive and negatively worded items. Evidence 
een dicate in addition that the content com 
ponent ol the F scal correlated with the acquies 
cem esp ‘ “et 

‘ ur g evidence Ol response el variance in 
the ordinar form of the | Cait Ka conlirms the 


lower | rehability ound here Leavitt doe now re 
port rehabilitu Neither study provides the correlations 
ol both k-+ and I with an external measu ol re 
ponse set which provides the major interpretive argu 
me t the resent paper 
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oTu the California E scale and F scale (1) 
B ire ¢ posed entirely of positively stated 
iter Ihe items are so worded that agreement 
wit the indicat ethnocentric’ or ‘authori 
tarian’’ attitudes while disagreement presumably 
indicate Live equalilarian” end ol a umed uni 
cumensional scale Recent inv tigations (2, 7, 8) 


1 the importance ol such a positive 


ten using the F scale as a measure ol 
attit Lhese studies assessed the relationships 
between I -scale iten in their original po itively 
tated form and f-scale it rewritten in ‘“‘re 


VETSE’ form so that disagreement rather than 
iret t could be considered an indication ol 
authoritarian attitude he correlations between 
apre ent to reversed and original items were 
yenerall found t be positive ( people who 
ree vilh positive tated | cak tel i1SO 
tenae ire Vilth the eve ed item even 
thouwh the item « tent wa PI t¢ 1 direction 
1} e studi cate that ren the accepted 
te practice ol ¢ interbalane the direction ol 
te < t /) 18 used 1 pec in authori 

I is con eted W eu sutl ere 
{ Pil I ‘ ui Clue il Research Fellows of the 
Nati il | il Mental Health. Dr. Jackso 
now a I re va i ite | ve t t L) 
Me it the | i il Test ‘ ea 
Princeton | ‘ t 


tarian scale, a paradoxical property of negative 
reliability is obtained 

This evidence obtained with the F scale may 
also be interpreted in terms of what Cronbach 


(4) has called an acquiescent response set, which 


has implications for interpreting the reported 
high correlations between the F scale and the E 
scale Chere is a possibility that the observed 


covariation Ol responses 
attributable t 


to the two scales may be 
in part a consistent generalized 
tendency to agree or acquiesce to items in print, 
rather than being soiely a function of consistent 
ideological beliefs 


In the 
Students 


present study, 93 junior and senior 


in high school, 40 males and 53 females 
ranging in age from 16 to 18, were given the 9-item 
i, scale, Forms 40 and 45 (1), the 
and the Gough intolerance 
clected 


reversed F scale,’ 
Scaie (0 The groups 


were from social studies classes and con 


tained only hristian student 


Lhe 


writing each 


white, ¢ 


reversed F scale was constructed by re- 


item so that the content would ap 


pear to re tlect a vie wpoint opposed to the or ginal, 
while For 


retaining a Similar Style ol expression 


indebted to 


4 The Mr. E. B. Weaver, 


Principal, Topeka High School, for kindly consenting 


writers are 


to cooperate in this project 
* The authors wish to thank Dr. Irwin Miller and 


Jame 5 bk 


Simpson for help in reversing the items. 











‘ETHNO( I 


1. A love of freedom and complete indepen¢ 

2. All the will power in the world will not h 

5] [he success of modern science proves tr 
man mind 


4. Because human nature is improving, war 


5. Every person should have complete faith 


whose decisions he obeys without que 

6. When a person has a problem or worry, | 
appcars 

7. The deeper and more enduring qualitic 
ternal things like manners, habits, an 

8. What the youth needs most is complete 


tream Ol Cooperative progress lor the 


NTRISM’’ AND ACQUIESCENCI 133 


rABLE 1 


Tue Reversep F Scat 








lence are the most important virtues children should learn 
elp us when weaknesses and ailhculties stand in our way 
lat every important thing can ultimately be understood by the hu 


and conflict will eventually be eliminated 
in his own inde pendent judgment, not in some supernatural power 
slion 


re should drop ever ything and concentrate upon it until the solution 


in getting along well with people are far more important than ex 
1 breeding 
freedom from discipline in order to flow naturally into the onward 


beneht Of ali Mankind 


Y. It is known with complete certainty that the urge to jump from high places is learned, not inborn 


10. Nowadays, since democracy demands th 


a person should not be finicky about ¢ 


at people of widely different background and station mix together, 
atching a disease from any of them 


11. An insult to our honor should always be overlooked, for ‘“‘whosoever shall smite thee on thy right cheek, turn 


to him the other also 


12. The rebellious ideas that young 


ante lature citizenship in adulthood 


people sometimes get must be encouraged and de veloped at all costs to guar 


15. Only if prewar authorities are kept out of the German government, will true Gemocracy be a hieved in that 


count 


4. What 


this country needs most, more thal 


t iclively share responsibilty 
15 ex ollenses, such as rape and attacks on 
treated with kindness and sympathy | 
16. All attempts to divide people into the tw 
17. Every truly mature person outgrows chi 
tude lor his parents 


I 
18 Such pseudosciences as astrology bave ne 
19. The surest way t 


20. In this scientitic age there can be no ju 


insure the disappearatr 


sonal and private matters 


1 Willlul leaders, are laws and political programs requiring all citizens 


children, never merit punishment; such offenders should always 
qualihed psychiatrist 


o distinct classes of the weak and the strong are doomed to failure 


ldish teelings olf submissive respect and of excessive love and grati 


ver explained anything and never will 
ce of the true American way of life is to resort to the use of force 
tification for denying investigators the right to study so-called per 


21. The complete end to wars and social problems will soon be realized 

22 There would be no immoral, crooked, or feel eminded people il we could get down to brass tacks and some 
how get rid olf our social prot ems 

23. The wild sex life of the old Greeks and Romans makes sexual goings-on in this country seem tame 

4 If pe ywple talked things over and didn’t work oO mu h, everybody would be better oll 

2 It is foolish and ridiculous to have ideas that our lives could possibly be controlled by plots hatched in secret 
place 

46. Ho exuals are never criminals and must not be punished as such 

27. The business man and the manufacturer are not any more important to society than the artist and the pro 
Ie 0 

28. No sane, normal, decent person can avoid having thoughts of hurting close friends or relatives 

29. We are ound to admire and respect a person il we get to know him well 


30. Nobody ever learned anything really imy 


yortant through suffering 


example, ‘‘Obedience and respect for authority are Ihe Gough intolerance scale is a group of items 
the most important virtues children should learn empirically derived from the MMPI using the 
was modified to read “‘A love of freedom and cor differentiation of extreme groups on the Levinson 
plete pendence are t ‘ portant virtue sanford Anti-Semitism scale as a criterion. The 
childre hould lear " No weakn r di intolerance scale has been shown to correlate 
culty ca j ick if we have enough will ignificantly with the California F scale, .46 and 
power is cha ed t \ the w powel 13 (6 Incidentally it hould be ioted with 
the wor | t help u I ecaknesst and respect to resp e sets that 29 of the 32 items on 
aifthiculti tand ] i' et ee Lable 1 Cx ry cale ire cored true to be the 
here | i alte ative ite that ild intol t”’ direction. Gough (6) also reports a 
etter epresé t the eve ri the il ligt levative correlatior 73 between the 
tatements. However, a defense of the adequacy tolerance scale and the MMPI A scale, a meas 
f the reversa ised the present stud ild ure ¢ test-tak ittitude that contains 29 of 
eem largel rel ant 1 i ¢ la typol LiKe 0 item cored saise 

authoritaria to he le ed | ter I Conn 5 ha round a gnificant correlation 
beliet i parti ilar cit ricai ¢ tent and between the | cale and a tendency to answer 
presuma disagreement th its reverse true” to a diverse group of MMPI items and has 








JAmEs 


rested that the correiat i lué 4 
generalized t en to ag wit tatement 
ce the preponae ince { it ‘ red true 
tl tolerance Cal iygvest that it too i 
licate ome generalized tendenc t LCCUIESCE 
eT ght expect cl i respons tendency to be 
reflected 1p tive correlat between agree 
ment to itolerance-scale ind reversed | cale 


er eve though the a positive correlation 
betwee the tolerance scale and the original | 
cale. OW cor ‘ 1 positive ¢ rrelation would be 
expected bet en the I cale and Gough ntoler 
ance cal ecause of their commo empirical 
ancestr 

The result licate that even though the Gough 
tolerance scale correlated posit vely with the I 
cal r 4% 2) in our ample it also 
correlated posuive with agreement to reversed 


F-escale iten r 4 Phu ibiects who ayreed 


th the intolerance-scale items tended to agree 
witt eversed | cal tem However, the tact 
that the tolerance cale appear to reflect a 
wcquiescent re jx ‘ et mav even contribute to 
t CO truct valiadit { but extreme caution 


lefining the characteristics 


‘ resp. lents on the ba ol item ¢ tent. Also 
the correlation between the EF scale and the re 
ersed | cale wa 04, rather than being gnin 


gative as would be expected Irom item 
highlight the 


study ng certain 


necessity otf 
attitude 


ntent These result 
co trolling item torm in 
authori 


uch a ethnocentrisn or 
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tarianism.”’ Also 


between bel 


pre viously re port d re lationships 
take a differ 
measures 


efs and personality may 


ent form when more adequate attitude 
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rIME AS A FUNCTION OF ANXIETY 


FREEMAN anp HOWARD MAHER 


lowa State College 


“RICH 5) has recentiy reported a tunct mal 

J elat hip between intelligence and d 
crimination react time rccountil I esult 
therwise attr itabie to the peratio | anxiet 

a drive triable. It appears that the rati ale 


ip assert that the l rimination 
reaction time test r A t be correlated with 
ntellectual tactor } I} tatement doe 
not appear to be ¢ npletel te | ith the 
ict is reported (srice elerence ; ind else 
where 1) 

On examinat of Melt data (4, pp. 102, 
1030), one finds that discrir ition react time 
(DRI onl lerately relate t in tellec 
tual” factor Further bevond the first t al ! 


DRI 


periormance, 


ally disappear. On the other hand ich matters as 
perceptual speed and simple react time assume 
omewhat greater importance as_ performance 
progresst ] Con equer tl any test olving 
perceptual factors may be expec to predict a 
percept ial criterior DRT perfor ince reason 
ably well. Grice reports that the Cl il Aptitude 
Fest, which he ed 1 ‘ge ntellectual 


“perceptual factor 


others. Guilford (2) indicates that 


among 
perceptual 
actor correlate about as well as a others in 
predicting DRT performance 

In short, it 


trated that 


entially 


with at 


would seem that Grice es 


ap ychometric devic« 


veighting ol perceptual factors does 
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reasonably well in predicting a type of performance 


that is largely perceptual in character! One may 
therelore question whether a functional relation 
hip between intelligence and DRT 


iably 


has been re 
indicated and whether such a relationship is 
substantively likely. Finally, one may question the 


use of the Clerical Aptit ide Test as the basis for 


equating groups for intelligence. Were not Air 
Force data on tests more universally recognized 


as general intellectual measures available 
It is 
tr 


the present note should have additional significance 


believed that some of the points raised in 


n view of the increasing use now being made of 
the 


interpretation of experimental samples of behavior 


psychometric measures in prediction and 


[here is implied in such an approach a joint 


responsibility among several research areas 


SOCIAL STATUS AND ABSTRACT 


DONALD C. FINDLAY' 


Human Research Unit No. 1 
Fort Knox, Kentucky 


N recent years “intelligence tests” have been 
I sharply criticized for an alleged cultural bias 
It has been argued that this bias handicaps the 
bright child who has grown up in a lower socio 
intelli 
gence test, according to the argument, places a 


economic environment. The present-day 


premium on knowledge readily accessible to the 
child in a middle-class home but seldom accessible 
to the child in a lower-class home 

Thus, when a lower-class child and a middle 


class child are found to have similar IQs, it is 


possible that the lower-class child is actually 
“genetically” brighter, and that his performance 
was handicapped by the cultural bias of the test 
(3, p. 4). Presumably, if the two children were 
tested with intellectual problems involving ma 
terials with which they were equally familiar 


the lower-class 
make a h gher score 

rhe present study is an attempt to evaluate this 
ng children of similar IQ 


backgrou 


(2, p. 66), child would probably 


assumption by compari 


but of dissimilar socioeconomk dona 


series of sorting problems requiring abstract 


behavior The problem appeared to involve a 


knowledge of only four relatively mpl block 

characteristics: height, width, shape, and color 
rhe hypothes tested wa If lower-statu 

children and middle-status children of similar 


This stuay was adapted from a doctoral disserta 
mitted to The [ 


senior author 


tion sul niversity of Texas by the 


AND 


AND 


~ 
oA 
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BEHAVIOR 


CARSON McGUIRE 
University of Texas 
IQ are given block-sorting problems involving 


concepts of equal familiarity to the two groups, 
the periormance ol the lower status children will 
that of the middle 


be significantly superior to 


status children. 
METHOD 


Ninety-six subjects, selected by age-grade, 
Ist 


Subjects 


index of status characteristics and intelligence 


quotient, were used. Thirty-two Ss with IQs ranging 
from 90 to 110 were selected nonsystematically from 
each of three grades (the third, sixth, and ninth) in the 
public school system of a central Texas community 
Half of the Ss at each grade level were selected from 
family backgrounds which rated as lower status (51 
yw less) by ISC standards. The other half of the Ss at 
each grade level were selected from middle-status 


higher than 51 kground 


Age-gracde 


family bac 


refers to children of the same age at a 


given school-grade level. The index of status charac 


teristics is the social class index developed by Warner 
and his associates (5 A modified form of the ISC based 
on occupation, dwelling area, and source of income was 
used (4 IQ was determined by means of the California 
Test of Mental Maturity. Third-grade were tested 
on the Primary Short-Form, sixth-grade Ss on the 
Elementary Short-Form, and ninth-grade Ss on the 
Intermediate Short-Form. Differences between mean 
IQs of the two status groups at each age-grade were 
not significant ((’s of .11, .18, .13 

Materials. Four block-sorting problems similar to 
those of Ach (1 were used. In each probien the 
task was to sort 20 blocks into four five-block grouy 

he four problems were derived from a matrix of 





DONALD C, FINDLAY AND CARSON McGutr! 


rABLE 1 


ARIZED ANALYSIS OF VARIANCE OF PERFORMANCE 
on ORTING PROBLEM 


had 


two cha i 


ng the concey 


In order t ermine the gnifiicance ol the per 


formance di es on Phase a, block 
pha 2. betwee! middle and 


analysis-of-variance procedurt 


sorting 
i¢ Status 
vere used. Com 
parisons were ni ade b age-gi ace If] VI, and IX, 
and by the four ord 


ol problem presentation 
lable 1). The mean difference between middle 
tatus Ss (12 15.2) was 
ignificant The decrements in 

an score grade III, 13.1 for 
age grade Vi and 11.2 tor age-gr ace IX were [tour d 


to be significant beyond the .001 level. Differences 
between order of presentation were 
r were vy of the 


Phase t the 


t 


interact 
experiment the chi-square test 
used to compare difference 


iddle-status 


between low 
mple c ncepts 
f the differ 
response 

ed) for each 


juares with 
H 


Status gre up 


the exper! 
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g ma middle-status Ss 


iperior to lower-status S 
by the method 


terials, yet 
( yenerali 
se findings is limited 


sampling election) 


well as by the tact th: nly one 


n the 
obtaining , 2s 
test 


used in 

intelligence 

was represented 
rhe 


the be 


incidentally, tk support 


t el in abstract 


vaye 
1 age and education through 
adolescence It 

resulted from 
the primary, 
intel 


ind ninth 


two st 


behavior in 
( hildh« 


however 


early possible, 
finding ucce 


content in 
rms ol 


this 


sively mo! 
ntermediate fc the 


, 
eleme! 
the third, 


sixth, 


the itus 


OUMMARY 


~ intellige 


yn that 


! 


ower-cia s childre n 


enavior) 
four rela 


imple concepts found to be equally familiar 
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The Menninger I 
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that 
than 


hypothesized 
brighter 


groups. It was 


subjects are 
and that their performance on 


to the two 


actually 


lower-status 
their [Qs indicate, 
the block-sorting problems would be superior to 


Lhe hy 


that of middle-status Ss of similar IQ 
Ss 


confirmed; middle-status 
to lower-status Ss 


pothesis was not 


were ignificantly uperior 
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Cronbach (6, 7) ha repeatedly called attention 
under the lahe f “response set 

Cohn (4 oting that all the items in the Cali 
fornia scale are lirectional, found a significant 


correlation between the F scale and a tendency to 


answer ‘“‘true to a diverse group ol MMPI items 
Jackson and Millet Jackson (12 tass (2), and 
Leavitt, Hax, and Roche 14 ndependently 
modified the direction of I cale item © that dis 


agreement rather than agreement would reflect the 
presumed authoritarian ideology. These studies 


elded generally posuwe correlatio! between 


its with the original and reversed iten 


ayreeme r 
such correlator do not indicate much about 
suthoritarian ideolog but they do tell us some 
tl r about the te! themsel ve na uyye t the 
1s ta concept like negative reliability 
If tl yeneralized tenden« to agree with iten 
pendent of their content were adequately con 


tr ed many I the relation } p prev ously ob 
tained with the I cale might be different 13 
The present tud in attempt to interpret the 
reported correlati between the Einstellung test 
ind the F scale as an indication that both measures 
are reflecting tendenci toward acquiescence, 
contormit and overgeneralization, rather than as 
i cation that suthoritarian hould be 
labeled rigid Reversed F-scale items are also 
expected to reflect tendencies toward conformity 


ind overgeneralizat ich that the reversed I 
cale hould correlate posit el th the Ejinstel 
lung test 
MeTHop 

yuhject 

Phe were 77 students from introductory psychol 
"We ind history classes in a midwestern municipal 
iniversity.* Their median age was 20, their mean age 
was 23.5 and the SD 7.6 year 
Materia 

The nstrument used were i] the Finstellung 
arithmetic problems, (6) California F scale, and (c) a 

ecia constructed reversed F scale 

The Einstellung arithmetic problem hese problems 
were identical with those used by Brown $ They 
consisted of two examples and seven item each of 
which involved withdrawing certain amounts of water 
from a large jar in order to obtain a desired quantity 
The second example and the two “‘set”’ problems could 
be solved by the formula B \ 2C, where B is the 
largest jar and C the smallest jar. Following these were 
five problems which could be solved either by the 
longer formula above or by the shorter formula A ( 
Brown noted that th procedure elicited lol g”’ solu 


Purdue University 1953 


‘Unpublished study 


Lise ed in (13 
‘ The authors are indebted to Dr. Bernard Spilka for 
making the F scale data av le and to Drs. Spilka 





itherford for 


and Homer k 


S. J. Messick, 
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tions more frequently than the one used by Rokeach 
19 
The original and reversed F scales. In addition to the 


Forms 40 and 45 (1 os 
reversed | which 
the direction of the 


4) item California F scale 


were administered a scal was 


constructed by reversing item 


content while retaining as much of the original style 


as possible. The reversed F-scale items are reported by 


Jackson and Messick (13 


Procedure 

Il'wo weeks prior to the administration of the Ein 
ipproximately 
About two 


stellung test and the reversed F scale 
half the Ss were given the original F 


weeks after taking the Einstellung test and the reversed 


scalc 


F scale, the original F scale was administered to most 


of the 


ince 


remaining half 

Brown observed that an ego-involving atmos 
phere was essential for obtaining correlations between 
“authoritarianism” and “rigidity,” a odification of 
McCle! 
conditions 
was introduced to the class as a psychologist who 


and’s procedure (16) was used to create achieve 


ment The experimenter, dressed in a dark 


suit 


administer some very important tests 


was going to 
were then read which suggested that the 


Instruction 


measures of intelligence and leadership 


tests were 
potential and that the Ss were being compared with 
students from a nearby state university. They were 


instructed to write their names on the test papers and 


then to proceed with the Einstellung test. After the 
forms were completed, they were given some informa 
tion about the true nature of the test in order to relieve 


that have arisen as a result of the 


achievement-oriented conditions 


any anxiety may 


RESULT 


Of the 77 Ss to whom the Einstellung test was 


in arith 
the 


ten made one or more errors 


eliminated 


administered, 


metic and were therefore from 
sample. Of the 67 Ss remaining, 13 who failed t 
or the critical solution consist 
This who 


long solution and 22 who always 


use either the et 
leit 32 Ss 


ently were also eliminated 2 


always used the 
used the short solutior 
classification 


these two | 


Iwo single analyses of variance 


were computed between instellung 


groups, one comparing them with respect to origi 


nal F-scale scores and the other with respect to 
reversed F-scale score Since directional hypothe 
ses were made, one-tail tests were used. Table 1 


indicates that both the I cale and the reversed 
F scale differentiate the Einstellung groups at the 
05 point. Thus, Ss who agree with F-scale items 


lung test 


tend to use the set solution on the Einst« 
However, 
! / 


same OS also tends 


with 


agreement reversed F-scale items 


for the to covary with the use 
Irwin Miller and James I 
For the 42 


was 


* We wish to thank Dr 
Simpson for help in constructing this scale 


Ss taking the original F scale, mean item score 


3.10; SD, 67. The 54 Ss 


received a mean item 


reversed scale 


taking the 
of 4.05; SD, 46 


score 
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TABLE 1 
DIFFERENCES BETWEEN EINSTELLUNG GROUPS 
WITH RESPECT TO ORIGINAL AND 
REVERSED F-ScALE SCORES 


I ale Scores Reversed F-Scale Scores 
& = 
23 _@ 
ource dj| Sei F p purce Yias F p 
+ . « 
aA oa 
Long” vs 1 |1227 |3.1 0 Long” 1 99 2.84 0 
rt”’ s 
ers short 
Solvers 
Withir 40 394 Within 52 | 190 


of the long Einstellung solution. Thus, by changing 


the direction of the item content, one may obtain a 
rigid’’ performance from people giving either 


“authoritari or “nonauthoritarian”’ responses 





he correlation between agreement with original 
and reversed F-scale items was +.35, rather than 
being significantly negative as consistent responses 
to content would require. The corrected split-half 
reliability of the reversed F scale was .77 The 


counterbalanced order of presentation ol the 


original and reversed F scales contributed no sig 
nificant effects. Also, the Einstellung results for 
both the F scale and reversed F scale were in the 


predicted lirection tor each of the two orders ol 


Disct ION 


rhe results empha ze the importance olf con 
trolling for response sets in constructing attitude 
sca It seems reasonable to assume that agree 
ment with a statement can be interpreted as re 
flecting belief in its content only if disagreement 
with the opposite content can also be obtained 


The tendency to agree with an item, amy item, es 


pecially when written ina cliché-like fashion may 
reflect important personality characteristics These 
n turn mav manifest themselves by increasing the 
correlational validity of the caie but not neces 
sarily the alidity as suggested by the content of 
the items (7, 11). When a respondent is confronted 
with a tem which contains a familiar phrase to 
which he has given little thought, agreement may 
represent merely a passive and acquiescing r¢ 
ponse. A preponderance of responses indicating 


iwreement to a large number of diverse items could 


thusr ilt either from belief in all of the statements 
wr trom an acquiescent response set. 5 ir 
ureement with an item may or may not reflect 
lisbelief its content; a person may be objecting 
merely to the style in which it is written 


Ch present stu ly emphas zes the importance 


‘ 


ot acquiescence in relationship between the I 
scale and the Einstellung test. It is important to 
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note, however, that Rokeach’s (19) original tind 
ings were with reference to the & scale and Einstel 
lung. Thus the conclusion that Rokeach’s results 
are primarily attributable to acquiescence or ré 
sponse set may not be warranted by the present 
evidence. It would be valuable to repeat Rokeach’s 
study substituting a different measure of anti 
minority attitude, such as a social distance scale, 
to determine if this also correlates with problem 
solving behavior 

We do not mean to imply that the search for 
personality and cognitive correlates of attitudinal 
clusters is fruitless or that relationships between 
rigidity and attitudes are inconceivable. The work 
of Stern, Stein, and Bloom (20) indicates that when 
each end of an authoritarian scale is defined by 
both positively and negatively-stated items, 
important relationships still appear, and in such 
a case it 1s possible to separate belief from re 
sponse set 

Ihus, if one end of an attitude continuum is 
defined by agreement with positively-stated item 
as is the case with the F scale, it does not follow 
that the opposite end of the continuum can be 
designated by merely tallying disagreements to the 
same items. It should also be noted that even if the 
direction of item content had been controlled in 
constructing the F scale, the mathematical model 
implicit in a Likert scoring procedure requires a 
homogeneous scale. As Green (10) has pointed out, 
the inte rcorrelations among the items are assumed 
to be due to a single common factor to which the 
items are all mutually related.” Since there is evi 
dence for multidimensionality in the ! cale 
(e.v., 5), a revision more basic than just controlling 
for acquiescent response sets is probably necessary 
before it can justifiably be used for measuring an 


attitudinal dimension (10, 17 


SUMMARY 


his study is an attempt to appraise the role of 
) agree with items inde 


be 


a generalized tendency t 
pendent of content in observed relationships 
tween “authoritarianism” and Einstellung ‘“‘ri 
gidity.”’ Subjects were given the Califérnia |] 
cale, a reversed F scale, and a series of Einstellung 
water-jar problems. Subjects who used the long 


rigid’’ solution in the Einstellung situation tended 


to agree with both F-scale and reversed F-scale 
tems These results suggested that both the 
Einstellung test and the F scale may be reflecting 
acquiescence and conformity. Also, the importance 


of defining both ends of a scale with positively 


tated unidimensional items before drawing con 
clusio! relating attitude and personal ty was 
er pl i ed 
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ANALITY, AGGRESSION, AND ACOUIESCENCE 


WILLIAM RABINOWITZ 


Bank Street College of Education 


| pecs 1) has recently presented evidence of a items. There is ample evidence that re ponse sets, 
r ish ‘ | 


elat hip betwee inalit ind = politica such a acquiescence, are a reliable ource of 
ayyre ) i hypothesized 1 | choanaly tic variance in test scores (2, 3), including tests of 
theor He found a significant correlation of .37 attitudes (1, 6), interests (5), and personality 
between two tests, the first a five-item measure ol characteristics (5). In view of this evidence, there 


and the other a four-item measure of is reason to believe that Farber’s results may stem 
yression. An examination of the items from acquiescent tendencies on the part of sub 


the two tests shows that the correlation may be a jects, rather than an inherent relation hip between 


function of what Cronbach has termed “response anality and aggression 

et’”’ (2, 3) rather than the result of a relationship [he most appropriate method of studying the 
betwee inality and aggression. On each of the anality-aggression relationship to employ tests 
inality items, the “Yes’’ option was keyed posi in which response sets cannot systematically 
tively, and on each of the political aggression items, influence scores. In the absence of such instru 
the Agree’ and “Strongly Agree” options were ments, an alternative procedure is to add to each 


keved posit vels The keying of the items | uch of Farber’s tests an equal number of items which 
that scores on the two tests are influenced in the measure anality and political aggression, but which 
same way by any general tendency on the part of are keyed so that the “No” and “Disagree” 
subjects to acquiesce, i.e., to say “Yes,” ““Agree,”’ options are positive. By examining the correlations 
or “Strongly Agree,”’ when responding to test among the original Farber tests and those newly 
n of Teacher Education, Muni constructed, it should be possible to determine 


' Now at the Divi 
cipal Colleges of New York whether the reported anality-aggression relation 
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hip is an artifact of the way in which the test 
items were keyed 
anality as summarized by 


Using the « mecept « 


Farber, several new test items which seemed to 
embody this trait were constructed. Unlike Far 
ber’s items, however, the items were stated so that 


‘ie om response would gnily greater anality 
The five items follow 


I usually lend my possessions to people who 

isk permission to use them Yes No 
If I’m doing something I enjoy, I often stay 

ip late at night, sometimes so late that - 

don’t get a full night’s sleep Ye No 
I’d rather lead a life of excitement and ad 


venture than one which is stable pre 


dictable, and well-organized Ye No 
Some peo] le think it’ 1 good idea to keep 

a budget so that purchases can be care 

fully planned, but in my case I don’t think 

I could stick to a budget for very long Yes No 
When I go to a restaurant I like to order 

foods which I have never tasted befor: Yes No 


Henceforth, Farber’s anality test will be referred 
to as the A, scale, the author’s anality test as the 
\ cale, and the combined 10 iten as the Aq 


Following Farber’ concept of political aggre 

msisting of four items was built to 
measure this attribute. Here too the items were so 
tated that a Disagree” re ponse vould signify 


greater aggression. The author’s four items follow 


There is good reason to believe that 


the iti f the world can live 
in peace Agree Disagree 
he foreign poli of the United 
tate hould be based on the a 
tion that ost other cour 
trie ure ir iri { I f yur 
enemiue Avree Di agret 
Capita pu hment I re 
il he A gree Disagree 
It healt! l a ce , i to 
re minorit lewpoint no 
matter | ‘ taste! the ma 
« Agree Disagree 
Hencefort! Farber’ test ol politic il iggre on 


vill be referred to as the P,; scale, the author’ 
te t of political aggre ion as the P, scale, and the 


ombined 8 items as the P+ cale 


2] Farber t each of the politica rre ion 
ite I 1d l nse tio Avree trongi A gree 
Slight Disagree Slight! ind Disagre« trongly. The 
sir ‘ rocedure of using only an Agree ar Di 
gree tion I each ite va ed in tl t be 
cause tl r ted the influ e of another rest £ 
set 1 wi f to g to extre es i re tering 
AgTe? ent or disa et ent 


rABLE 1 
CORRELATIONS OF A,;, As, Pi, AND P 


N 143 


A, + .30T +12 + .18* 04 
A + 327 O5 +. 15° 
P; + 53t +. 16* 


t Reliability coefficients based on Kuder-Richardson Formula 


The Ay and Py scales were administered to a 
group of 143 undergraduate college students 
(approximately 90 per cent female) who responded 
anonymously. Both scales were administered in a 
single session, the Py scale immediately following 
the Ay scale The re ponses given by the subjects 


were summarized in the following score 


Test Way of Obtaining Score 
A, Number of “Yes” responses 
Anality score \ Number of “No” responses 


Ay Sum of above two scores 
P, Number of “Agree” responses 
Political Ag . 

P; Number of Disagree re 

gression 
sponses 

COre > 

Py um otf above two scores 


Acquiescence Ay Number of “Yes” responses 


ores Py Number of Agree” responses 


On the assumption that anality and political 
aggression are more closely related than a response 
et to acquiesce, the correlation of Ay and P, 
hould be higher vhen cored for anality and 


political aggression than when scored for acqui 


escence The actual correlatior are a follows 

Ar V Py, (scored for anality and aggression) 
r +.O9 (not significant) 

Ay V Py cored lor acquiescence r ; 27 


ignificant at the .01 level) 


These correlations suggest that the reported 
anality-aggression relation hip l a tunction ol 
acquiescence Further evidence to upport this 
view appears in the intercorrelations of A;, A», Py, 
and P, a pre ented in Table 1. Both the correla 
tion of A; with P,, and Az with P2 reach signifi 
cance, though Farber reported i higher correlation 
between A, and P,. Since these correlation nvoive 
test vhose iten are scored in the same direction, 
it is impossible to tell whether they are a function 


of acquiescence or an anality-aggression relation 


hip. This ambiguity not present in the correla 
tions ol Ay with P, and A. with P ince the items 
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of the tests involved in these correlations are 
The obtained non 


that the 


cored i opposite direction 


ignificant correlation uggest relation 


ship between anality and political aggression is 


dependent on keying items in the same direction 


These results clearly suggest that a response set 


to acquiesce, not the reported anality-aggression 


relationship, adequately accounts for Farber’s 


finding 4 possible objection can be raised, how 


ever, to the scales developed by the author as 


inadequate measures of anality and _ political 
agyre ion The low correlations between A, and 
A,. and P, and P, indicate that Farber’s scales and 
those developed in this study are not measuring 


attributes. The most obvious difference 


between the re 


the ume 


pective scales is that acquiescent 


tendencie are weighted differently in Farber’s 


and the author’s tests. The reliabilities reported in 


Table 1 indicate that Ag and A, are about equally 
homogeneous. The author’s test of anality is thus 
as internally consistent as is Farber’s scale. How- 
ever, since P, appears less reliable than P,, Farber’s 


ion seem 
, the total 
political 


test of pol tical aggres the more adequate 


measure. Neverthele cale when scored 


for anality and aggression are fairly 


RABINOWITZ 


homogeneous, as evidenced by their Kuder 
Richardson reliability coefficients (Formula #20): 


Ar...f = 53 
Pr...7 = 47 


These reliabilities are high enough to permit a 
correlation of +.50 between Ay and Py. The failure 
of the obtained correlation to approach this maxi- 
mum cannot be attributed to unreliability of Ar 
and Py due to the items added in this study 
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